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MOLECULES ASSOCIATED WITH THE HUMAN FUSED GENE 



Technical field 

The present invention relates to novel molecules, such as proteins, polypeptides and 
nucleotides, involved in the transduction of signals in the hedgehog-patched (HH- 
PTC) pathway, which takes place during the development of the cells of a human 
5 body. The invention also relates to certain advantageous uses of the molecules 
according to the invention in diagnosis and therapy. 

Background 

In the study of the development of cells, fruit flies have extensively been used as a 
10 model, as they are less complex than mammalian cells. 

Pattern formation takes place through a series of logical steps, reiterated many times 
during the development of an organism. Viewed from a broader evolutionary per- 
spective, across species, the same sort of reiterative pattern formations are seen. The 

1 5 central dogma of pattern formation has been described (Lawrence and Struhl, 1 996). 
Three interlocking and overlapping steps are defined. Firstly, positional information 
in the form of morphogen gradients allocate cells into non-overlapping sets, each set 
founding a compartment. Secondly, each of these compartments acquire a genetic 
address, as a result of the function of active "selector" genes, that specify cell fate 

20 within a compartment and also instruct cells and their descendents how to commu- 
nicate with cells in neighboring compartments. The third step involves interactions 
between cells in adjacent compartments, initiating new morphogen gradients, which 
directly organize the pattern. 

25 Taking these steps in greater detail, one finds the first step in patterning to be the 
definition of sets of cells in each primordium. Cells are allocated according to their 
positions with respect to both dorsoventral and anterior/posterior axes by morpho- 
gen gradients. Allocation of cells in the dorsoventral axis constitutes the germ lay- 
ers, such as mesoderm or neurectoderm. 
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In segmentation, the second step (the specification of cell fate in each compartment) 
is carried out by the gene engrailed and elements of the bithorax complex. Engrai- 
led defmes anterior and posterior compartments both in segmentation and in limb 
specification. 

5 

The third step in pattern formation, secretion of morphogens, functions to differenti- 
ate patterns within compartments (and thereby establish segment polarity). Initially, 
all cells within a compartment are equipotent, but they become diversified to fom 
pattern. Pattern formation depends on gradients of morphogens, gradients initiated 

10 along compartment boundaries. Such gradients are established by a short-range sig- 
nal induced in all the cells of the compartment in which the above mentioned selec- 
tor gene engrailed is active. For segment polarity, this signal is Hedgehog. In the 
adjacent compartment the selector gene is inactive, ensuring that the cells are sensi- 
tive to the signal. The Hedgehog signal range is probably only a few rows of cells 

1 5 wide; responding cells become a linear source of a long-range morphogen, that dif- 
fuses outward in all directions. There are three known Hedgehogs, Sonic (SHH), In- 
dian (IHH) and Desert (DHH). The proteins they encode can substitute each for 
each other, but in wildtype animals, their distinct distributions result in unique acti- 
vities. SHH controls the polarity of limb growth, directs the development of neurons 

20 in the ventral neural tube and patterns somities. IHH controls endochondral bone 
development and DHH is necessary for spermiogenesis. Vertebrate hedgehog genes 
are expressed in many other tissues, including the peripheral nervous system, brain, 
lung, liver, kidney, tooth primordia, genitalia and hindgut and foregut endoderm. 

25 Thus, segment polarity genes have been identified in flies as mutations, which 

change the pattern of structures of the body segments. Mutations in these genes cau- 
se animals to develop the changed patterns on the surfaces of body segments, the 
changes affecting the pattern along the head to tail axis. For example, mutations in 
the gene patched cause each body segment to develop without the normal structures 

30 in the center of each segment. Instead there is a mirror image of the pattern normally 
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found in the anterior segment. Thus, cells in the center of the segment make the 
wrong structures, and point them in the wrong direction with reference to the over 
all head-to-tail polarity of the animal. 

5 About sixteen genes in the class are known. The encoded proteins include kinases, 
transcription factors, a cell junction protein, two secreted proteins called wingless 
(WG) and the above mentioned Hedgehog (HH), a single transmembrane protein 
called patched (PTC) and some novel proteins not related to any known protein. All 
of these proteins are beleived to work together in signaling pathways that inform 
10 cells about their neighbors in order to set cell fates and polarities. 

PTC has been proposed as a receptor for HH protein based on genetic experiments 
in flies. A model for the relationship is that PTC acts through a largely unknown 
pathway to inactivate both its own transcription and the transcription of the wingless 
1 5 segment polarity gene. This model proposes that HH protein, secreted from adjacent 
cells, binds to the PTC receptor, inactivates it and thereby prevents PTC from tur- 
ning off its own transcription or that of wingless. A number of experiments have 
shown coordinate events between PTC and HH. 

20 WO 96/1 1260 discloses the isolation of patched genes and the use of the PTC pro- 
tein to identify ligands, other than the established ligand Hedgehog, that bind there- 
to. However, even though it is briefly suggested that drugs may be identified which 
can prevent the transduction of signals by the PTC protein, there are no teachings as 
regards how such signals are transduced. 

25 

In order to elucidate how the Hedgehog elecits signal transduction, a large complex 
containing the kinesin-related protein costal 2 has been proposed (Robbins et al; 
Cell : July 25, 1997, 90(2), p. 225-34). Said complex includes the products of at least 
three genes: fused (a protein-serine/threonine kinase), cubitus interruptus (a tran- 
30 scription factor) and costal 2 (a kinesin-like protein). It is concluded that in Droso- 
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phila, the complex may facilitate signaling from HH by governing access of the cu- 
bitus interruptus protein to the nucleus. 

Therond et al have also studied signaling from Hedgehog in Drosophila (Proc. Natl. 
5 Acad. Sci. USA, April 30, 1 996, 93(9), p. 4224-8). The Drosophila gene fused (fu) 
encodes a serine/threonine-protein kinase, that genetic experiments have implicated 
in signaling initiated by hedgehog. It is proposed that the fused protein is phospho- 
rylated during the course of Drosophila embryogenesis, as a result of hedgehog acti- 
vity. The conclusion of this reference is, that a reconstruction of signaling from hed- 
10 gehog in cell culture should provide further access to the mechanisms by which the 
hedgehogs acts. 

Thus, even though the gene fused (fu) has been identified in Drosophila, it has not 
been isolated from human beings in the prior art. The prior art teachings about the 

1 5 interactions between hedgehog and patched in mammals are not at all sufficient for 
practical application in regard of human beings. Firstly, there is a need of a better 
understanding of how signals are actually elicited and transduced between the hed- 
gehog ligand and the patched receptor. Secondly, the human homologues of the ge- 
nes and proteins implicated in this pathway must also be fully sequenced, and the 

20 cDNA thereof identified, in order to enable recombinant reproduction thereof. 

However, due to substantial genetic differences between fruit flies and human 
beings and the complex nature of the human genome, the isolation of the human 
homologues to genes identified in fruit flie is in no way a straightforward task. 

25 

Summary of the invention 

The present invention fulfill the above defined desire by providing novel human 
homologues to molecules associated with the Drosophila fused gene and implicated 
in the transduction of the signals that are elicited by the interaction between the pat- 
30 ched receptor (sometimes denoted the NBCCS gene) and any one of the hedgehog 
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ligands. Thus, the present gene has according to the invention been localised to hu- 
man chromosome 15q22-24. The molecules according to the invention are nucleic 
acids as well as polypeptides and proteins encoded thereof, which are useful within 
several fields, including the study of different conditions, such as cancer and deve- 
5 lopment of cancer therapies, the regulation of gene transcription, studies of embryo- 
nic development, tissue repair etc. Two specific syndromes have been mapped to the 
15q22-24 genomic region, namely "Bardet-Biedl Syndrome" and "Cerebral Gigan- 
tism/Sotus Syndrome" (15q22 or 5q35). The latter is a cancer predisposition 
syndrome also involving craniofacial defects and growth disturbances making the 
10 FUH an interesting candidate gene. The great importance of the present pathway al- 
so makes the FUH a potential "oncogene" or "tumor suppressor gene". 

The present invention provides an important basic understanding of a signaling pat- 
hway that is central to normal development and often disrupted in disease. The new 
1 5 knowledge will also be of great value when considering new therapeutic strategies 
involving modification of SHH-PTCH signaling. Potential areas include tissue re- 
pair/wound healing (brain, bone, cartilage, skin), neurodegenerative disease, testi- 
cular function and cancer. 

20 Detailed description of the invention 

Accordingly, in a first aspect, the present invention relates to a protein molecule ca- 
pable of being involved in eliciting intracellular signaling in the human HH-PTC 
pathway, preferably the SHH-PTC pathway. The protein according to the invention 
exhibits a substantial similarity with the sequence disclosed in SEQ ID NO 2 of the 

25 Sequence Listing shown below. In a particular embodiment, the protein according to 
the invention is comprised of about 80% of said sequence and in a specific embodi- 
ment, it is comprised of substantially all of said amino acid sequence. However, it 
should probably include at least about 60% of said sequence in order to exhibit the 
advantageous properties as disclosed in the present application. As someone of skill 

30 in this area easily realises, the percentages given herein will depend on possible 
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further added fragments or sequence parts, as a protein exhibiting 80% of the 
sequence of SEQ ID NO 2, to which a further fragment has been added, naturally in 
itself will include a smaller percentage of the present sequence. Thus, it is to be un- 
derstood that any sequence including the herein given percentages of SEQ ID NO 2 
5 as disclosed is within the scope of the invention. Analogues of, and functional equi- 
valents to, the proteins according to the invention are also encompassed by the pre- 
sent invention. Thus, the protein according to the invention is an isolated human fu- 
sed protein, (FUH). 

1 0 According to the present invention, it has been found that the human homologue of 
Drosophila fused is an intracellular transducer of the signal elicited at the cell mem- 
brane by interaction of ligands, such as sonic hedgehog (SHH), indian hedgehog 
(IHH) and desert hedgehog (DHH), with the receptor human homologue of Droso- 
phila patched (PTC). This interaction leads to initiation of a signaling pathway by 

1 5 allowing PTC or its coreceptor, human homologue of Drosophila smoothened 

(SMOH), to activate intracellular signal transducers. As mentioned above, in Droso- 
phila c€i\s, fused interacts with the microtubule associated protein costal-2 (cos-2) 
and the transcription factor cubitus interruptus (ci). The latter has human homolo- 
gues denoted GLI 1-3. 

20 

Thus, in a second aspect, the present invention also relates to polypeptides compri- 
sed of suitable fragments or parts of the amino acid sequence of the above described 
protein. Thus, such a polypeptide is also associated with the HH-PTC pathway, such 
as SHH-PTC, and is comprised of a subsequence of up to about 20, such as 8-15, 

25 preferably 9-12 and most preferred 10 contiguous amino acids of the sequence 

disclosed in SEQ ID NO 2 or conservative substitutions of said sequence. The poly- 
peptide according to the invention may be encoded by a nucleic acid amplified from 
genomic DNA or RNA using appropriate primers which the skilled man in this field 
easily chooses. In a specific embodiment, the polypeptide according to the invention 

30 is presented as an antigen, which elicits the production of an antibody which speci- 
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fically binds to a polypeptide encoded by a nucleic acid according to SEQ ID NO 1, 
and said polypeptide does not bind to antisera raised against a polypeptide encoded 
by a nucleic acid sequence according to SEQ ID NO 1, which has been fully immu- 
nosorbed with a polypeptide encoded by a sequence of SEQ ID NO 2. 

5 

A third aspect of the present invention is a nucleic acid encoding any one of the 
above defined proteins or polypeptides. In a preferred embodiment, the nucleic acid 
according to the invention is such a DNA sequence as disclosed in SEQ ID NO 1 . 
Thus, according to the present invention, for the first time, a human homologue of 

1 0 the Drosophila fused gene has been isolated, the cDNA thereof has been established, 
sequenced and cloned. The complete cDNA sequence thereof is disclosed in SEQ 
ID NO 1 . The present invention also encompasses nucleotides that hybridises to part 
or all. of the sequence of SEQ ID NO 1 as well as to nucleotides hybridising to the 
DNA encoding the proteins and polypeptides according to the invention. Such nuc- 

1 5 leotides may e.g. be RNA sequences. 

The human homologue according to the invention exhibits an amino acid similarity 
with Drosophila fused of 54%. Within the kinase domain, the similarity increases to 
59,5%. Further, the present human homologue of fused exhibits an amino acid iden- 
20 tity with Drosophila fused of 33,5%. Within the kinase domain, the identity increa- 
ses to 37%. 

In a fourth aspect, the present invention also relates to a vector comprising a nucleic 
acid according to the invention, such as part of, or all of, the DNA sequence disclo- 
25 sed in SEQ ID NO 1. The vector preferably include the above described nucleic acid 
operably linked, i.e. under the control of, a promoter; either constitutive or inducib- 
le. The vector can also include suitable initiation and termination codons. 
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V 



In addition, the invention also relates to an expression cassette comprising any one 
of the nucleic acids according to the invention. Accordingly, yet another aspect of 
the invention is a cell comprising an expression cassette according to the invention. 
In one further aspect, the invention relates to an antibody which specifically binds to 
5 such a polypeptide as defined above, which preferably comprises at least about 1 0, 
more preferably at least 20, 40 or 50 and most preferably at least 1 00 or 200, or even 
400, amino acids. Alternatively, the antibody binds to a protein encoded by essenti- 
ally all of the sequence as disclosed in SEQ ID NO 1 . The antibody is polyclonal or 
monoclonal, preferably, it is a monoclonal antibody. It can be humanized or human. 

10 

One specific embodiment of the present invention is a kinase-activating substance, 
which is capable of activating a protein according to the invention. Alternatively, 
the invention relates to a kinase-inhibiting substance. Such kinase-regulating sub- 
stances may be used as drugs. Pharmaceutical compositions to this end comprise a 
1 5 kinase-activator or kinase-inhibitor capable of influencing the protein according to 
the invention together with a pharmaceutically acceptable carrier. 

The present invention also relates to a cell, such as a recombinant cell, e.g. hybri- 
domas or triomas, expressing such an antibody as defined above. Also encompassed 
20 by the invention is the use of such an antibody as a medicament or in the manufactu- 
re of a specific drug or therapy. 

Thus, another aspect of the invention is a pharmaceutical composition which comp- 
rises a molecule selected from the group consisting of a vector encoding a polypep- 
25 tide according to the invention or a subsequence thereof, a polypeptide according to 
the invention or a subsequence thereof and an antibody raised against the present 
polypeptides together with a pharmaceutically acceptable carrier. 



Yet another aspect of the invention is a kit for detection of a human fused gene, 
30 which kit comprises a nucleic acid sequence capable of hybridising to an essential 
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part of the sequence disclosed in SEQ ID NO 1, preferably under stringent condi- 
tions. The invention also relates to a kit for detection of a protein encoded by a hu- 
man fused gene sequenced according to the present invention, comprising a contai- 
ner containing an antibody as defined above. 

5 

In a last aspect, the invention also relates to methods for detecting any deviations 
from a normal human fused gene. The method includes the steps of a) providing a 
biological sample of the organism and b) detecting a human fused gene or gene pro- 
duct in the sample. The provision of a biological sample and detection methods will 
1 0 be described below. 

The invention also relates to diagnostic and prognostic methods, wherein mutations 
in pathway components are detected, as well as to kits for performing such methods. 

1 5 Finally, the present invention also relates to methods of treatment, wherein the abo- 
ve defined molecules are used. Conditions that may be contemplated include can- 
cers, such as basal cell cancer of the skin, medulloblastomas, trichoepitheliomas and 
breast cancer, conditions caused by defects in brain development, lung development, 
tooth and hair development and sperm formation, and tissues repair. The present 

20 molecules are also useful in gene therapy. The methods according to the invention 
can involve transfecting cells of a mammal with a vector expressing a polypeptide 
or antibody according to the invention. The transfection can be in vivo or ex vivo. Ex 
vivo transfection is suitably followed by re-infusing the ceils into the organism. Ot- 
her methods involve administering to the mammal, e.g. a human, of a therapeuti- 

25 cally effective dose of a composition comprising a polypeptide according to the in- 
vention and a pharmacological excipient and/or carrier. 

Definitions 

The term "antibody" refers to a polypeptide substantially encoded by an immunog- 
30 lobulin gene or immunoglobulin genes, or fragments thereof which specifically bind 
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and recognize an analyte (antigen). The recognized immunoglobulin genes include 
the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as 
well as the myriad immunoglobulin variable region genes. Light chains are classifi- 
ed as either kappa or lambda. Heavy chains are classified as gamma, mu alpha, del- 
5 ta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD 
and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having 
one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
1 0 of each chain defines a variable region of about 1 00 to 1 1 0 or more amino acids 
primarily responsible for antigen recognition. The terms variable light chain (V L ) 
. and variable haeavy chain (V H ) refer to these light and heavy chains, respectively. 

Antibodies exist e.g., as intact immunoglobulins or as a number of well characteri- 
1 5 zed fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produ- 
ce F(ab)' 2 , a dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfi- 
de bond. The F(ab)' 2 may be reduced under mild conditions to break the disulfide 
linkage in the hinge region, thereby converting the F(ab)' 2 dimer into an Fab' mo- 
20 nomer. The Fab' monomer is essentially a Fab with part of the hinge region (see, 
Fundamental Immunology, Third Edition, W.E. Paul, ed., Raven Press, N.Y. 1993). 
While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de 
novo either chemically or by utilizing recombinant DNA methodology. Thus, the 
25 term antibody, as used herein, also includes antibody fragments either produced by 
the modification of whole antibodies or those synthesized de novo using recombi- 
nant DNA methodologies (e.g., single chain Fv). 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
30 portion thereof, is altered, replaced or exchanged so that the antigen binding site 
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(variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new proper- 
ties to the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, 
etc.; or (b) the variable region, or a portion thereof, is altered, replaced or exchanged 
5 with a variable region having a different or altered antigen specificity. 

The term "immunoassay" is an assay that utilizes an antibody to specifically bind an 
analyte. The immunoassay is characterized by the use of specific binding properties 
of a particular antibody to isolate, target, and/or quantify the analyte. 

10 

The terms "isolated" "purified" or "biologically pure" refer to material which is sub- 
stantially or essentially free from components which normally accompany it as 
found in its native state. 

1 5 The term "nucleic acid" refers to a deoxyribonucleotide or ribonucleotide polymer 
in either single- or double-stranded form, and unless otherwise limited, encompasses 
known analogs of natural nucleotides that can function in a similar manner as natu- 
rally occurring nucleotides. 



20 The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical analogue of a cor- 
responding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. 

25 

A "label" is a composition detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include 32 P, fluore- 
scent dyes, electron-dense reagents, enzymes (e.g., as commonly used in a ELISA), 
biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal antibo- 
30 dies are available (e.g., the peptide of SEQ ID NO 1 can be made detectable, e.g., by 
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incorporating a radio-label into the peptide, and used to detect antibodies specifi- 
cally reactive with the peptide). 

As used herein a "nucleic acid probe" is defined as a nucleic acid capable of binding 
to a target nucleic acid of complementary sequence through one or more types of 
chemical bonds, usually through complementary base pairing, usually through hyd- 
rogen bond formation. As used herein, a probe may include natural (i.e. A, G, C, U 
or T) or modified bases (7-deazaguanosine, inosine, etc.) In addition, the bases in a 
probe may be joined by a linkage other than a phosphodiester bond, so long as it do- 
es not interfere with hybridization. Thus, for example, probes may be peptide nuc- 
leic acids in which the constituent bases are joined by peptide bonds rather that 
phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe 
sequence depending upon the stringency of the hybridization conditions. The probes 
are preferably directly labeled as with isotopes, chromophores, lumiphore, chromo- 
gens, or indirectly labeled such as with biotin to which a streptavidin complex may 
later bind. By assaying for the presence or absence of the probe, one can detect the 
presence or absence of the select sequence or subsequence. 

A "labeled nucleic acid probe" is a nucleic acid probe that is bound, either cova- 
lently, through a linker, or through ionic, van der Waals or hydrogen bonds to a la- 
bel such that the presence of the probe may be detected by detecting the presence of 
the label bound to the probe. 

The term "target nucleic acid" refers to a nucleic acid (often derived from a biologi- 
cal sample), to which a nucelic acid probe is designed to specifically hybridize. It is 
either the presence or absence of the target nucleic acid that is to be detected, or the 
amount of the target nucleic acid that is to be quantified. The target nucleic acid has 
a sequence that is complementary to the nucleic acid sequence of the corresponding 
probe directed to the target. The term target nucleic acid may refer to the specific 
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subsequence of a larger nucleic acid to which the probe is directed or to the overall 
sequence (e.g., gene or mRNA) whose expression level it is desired to detect. The 
difference in usage will be apparent from the context. 

"Subsequence" refers to a sequence of nucleic acids or amino acids that comprise a 
5 part of a longer sequence of nucleic acids or amino acids (e.g., polypeptide), respec- 
tively. 

The term "recombinant" when used with reference to a cell, or nucleic acid, or vec- 
tor, indicates that the cell, or nucleic acid, or vector, has been modified by the intro- 
1 0 duction of a heterologous nucleic acid or the alteration of a native nucleic acid, or 
that the cell is derived from a cell so modified. Thus, for example, recombinant cells 
express genes that are not found within the native (non-recombinant) form of the 
cell or express native genes that are otherwise abnormally expressed, under expres- 
sed or not expressed at all. 

15 

The term "identical" in the context of two nucleic acids or polypeptide sequences 
refers to the residues in the two sequences which are the same when aligned for 
maximum correspondence. Optimal alignment of sequences for comparison can be 
conducted, e.g., by the local homology algorithm of Smith and Waterman ( 1 98 1 ) 

20 Adv. Appl. Math. 2: 482, by the homology alignment algorithm of Needleman and 
Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson 
and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by computerized imple- 
mentations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wi- 
sconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 

25 Madison, WI) or by inspection. 

An additional algorithm that is suitable for determining sequence similarity is the 
BLAST algorithm, which is described in Altschul et al. ( 1 990) J. Mol. Biol. 215: 
403-410. Software for performing BLAST analyses is publicly available through the 
30 National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
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This algorithm involves first identifying high scoring sequence pair (HSPs) by iden- 
tifying short words of length W in the query sequence that either match or satisfy 
some positive-valued threshold score T when aligned with a word of the same 
length in a database sequence. T is referred to as the neighborhood word score 

5 threshold (Altschul et al, supra.) These initial neighborhood word hits act as seeds 
for initiating searches to find longer HSPs containing them. The word hits are ex- 
tended in both directions along each sequence for as fas as the cumulative alignment 
score can be increased. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achie- 

1 0 ved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 
reached. The BLAST algorithm parameters W, T and X determine the sensitivity 
and speed of the alignment. The BLAST program uses as defaults a word length 
(W) of 1 1, the BLOSUM62 scoring matrix {see Henikoff and Henikoff (1992) Proc. 

1 5 Natl. Acad. Sci. USA 89: 1 09 1 5- 1 09 1 9) alignments (B) of 50, expectation (E) of 1 0, 
M=5, N=4, and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis of the similarity between two 
sequences; see e.g., Karlin and Altschul (1993) Proc. Nat 7 Acad. Sci. USA 90: 

20 5873-5787. One measure of similarity provided by the BLAST algorithm is the 

smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by 
chance. For example, a nucleic acid is considered similar to a fused gene or cDNA if 
the smallest sum probability in a comparison of the test nucleic acid to a fused nuc- 

25 leic acid is less than about 1 , preferably less than about 0.1, more preferably less 
than about 0.01, and most preferably less than about 0.001 . 

The term "substantial identity" or "substantial similarity" in the context of a poly- 
peptide indicates that a polypeptides comprises a sequence with at least 70% 
30 sequence identity to a reference sequence, or preferably 80%, or more preferably 
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85% sequence identity to the reference sequence, or most preferably 90% identity 
over a comparison window of about 10-20 amino acid residues. An indication that 
two polypeptide sequences are substantially identical is that one peptide is immu- 
nologically reactive with antibodies raised against the second peptide. Thus, a poly- 
5 peptide is substantially identical to a second polypeptide, for example, where the 
two peptides differ only by a conservative substitution. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide which the first nucleic acid encodes is immunologically cross reactive 
10 with the polypeptide encoded by the second nucleic acid. 

Another indication that two nucleic acid sequences are substantially identical is that 
the two molecules hybridize to each other under stringent conditions. 

1 5 "Bind(s) substantially" refers to complementary hybridization between a probe 
nucleic acid and a target nucleic acid and embraces minor mismatches that can be 
accomodated by reducing the stringency of the hybridization media to achieve the 
desired detection of the target polynucleotide sequence. 

20 The phrase "hybridizing specifically to", refers to the binding, duplexing, or hybri- 
dizing of a molecule only to a particular nucleotide sequence under stringent condi- 
tions when that sequence is present in a complex mixture (e.g., total cellular) DNA 
or RNA. The term "stringent conditions" refers to conditions under which a probe 
will hybridize to its target subsequence, but to no other sequences. Stringent condi- 

25 tions are sequence-dependent and will be different in different circumstances. Long- 
er sequences hybridize specifically at higher temperatures. Generally, stringent con- 
ditions are selected to be about 5°C lower than the thermal melting point Tm for the 
specific sequence at a defined ionic strength and pH. The Tm is the temperature 
(under defined ionic strength, pH, and nucleic acid concentration) at which 50% of 

30 the probes complementary to the target sequence hybridize to the target sequence at 
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equilibrium. (As the target sequences are generally present in excess, at Tm, 50% of 
the probes are occupies at equilibrium). Typically, stringent conditions will be those 
in which the salt concentration is less than about 1 .0 M Na ion, typically about 0.01 
to 1 .0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is 
at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 
60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions may 
also be achieved with the addition of destabilizing agents such as formamide. 
The phrases "specifically binds to a protein" or "specifically immunoreactive with", 
when referring to an antibody refers to a binding reaction which is determinative of 
the presence of the protein in the presence of a heterogeneous population of proteins 
and other biologies. Thus, under designated immunoassay conditions, the specified 
antibodies bind preferentially to a particular protein and do not bind in a significant 
amount to other proteins present in the sample. Specific binding to a protein under 
such conditions requires an antibody that is selected for its specificity for a particu- 
lar protein. A variety of immunoassay formats may be used to select antibodies spe- 
cifically immunoreactive with a particular protein. For example, solid-phase ELISA 
immunoassays are routinely used to select monoclonal antibodies specifically im- 
munoreactive with a protein. See Harlow and Lane (1988) Antibodies, A Laboratory 
Manual, Cold Spring Harbour Publications, New York, for a description of immu- 
noassay formats and conditions that can be used to determine specific immunoreac- 
tivity. 

A "conservative substitution", when describing a protein, refers to a change in the 
amino acid composition of the protein that does not substantially alter the protein's 
activity. Thus, "conservatively modified variations" of a particular amino acid 
sequence refers to amino acid substitutions of those amino acids that are not critical 
for protein activity or substitution of amino acids with other amino acids having si- 
milar properties (e.g:, acidic, basic-; positively or negatively charged, polar or non- 
polar, etc.) such that the substitutions of even critical amino acids do not substanti- 
ally alter activity. Conservative substitution tables providing functionally similar 
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amino acids are well known in the art. The following six groups each contain amino 
acids that are conservative substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D), Glutamic acid (E); 
5 3) Asparagine (N), glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan ( W). 

See also, Creighton (1984) Protein, W.H. Freeman and Company. In addition, indi- 
1 0 vidual substitutions, deletions or additions which alter, add or delete a single amino 
acid or a small percentage of amino acids in an encoded sequence are also 
"conservatively modified variations". 

A "gene product", as used herein, refers to a nucleic acid whose presence, absence, 
1 5 quantity, or nucleic acid sequence is indicative of a presence, absence, quantity, or 
nucleic acid composition of the gene. Gene products thus include, but are not limi- 
ted to, an mRNA transcript a cDNA reverse transcribed from an mRNA, an RNA 
transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed 
from the amplified DNA or subsequences of any of these nucleic acids. Polypeptides 
20 expressed by the gene or subsequences thereof are also gene products. The particu- 
lar type of gene product will be evident from the context of the usage of the term. 

EXPERIMENTAL 
General methods 

25 I. Uses of the fused cDNA 

Among other applications, detection of defective fused gene expression is of clinical 
value. The presence of a fused gene, cDNA, mRNA, protein or subsequences of the 
gene, cDN A" Or protein in a biological sample is useful, e.g: as a marker to asses in 
vivo and/or in situ RNA transcription and/or translation, in cancer diagnostics, in 

30 prophylaxis etc. Full-length fused cDNA, individual exons, or subsequences thereof 
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are also useful as probes (particularly when labeled) for the detection of the presen- 
ce or absence and/or quantification of normal or abnormal (e.g. truncated or muta- 
ted) fused DN A or RNA in a biological sample. The labeled probes can also be us- 
eful as in karyotyping analysis as markers of iht fused gene. 
The proteins and polypeptides according to the invention are also useful as lead 
compounds in the design of organic molecules resembling the native protein, fully 
or in part. The designed molecules preferably exhibits the activity of the native one. 
Methods for protein mimicking are today used widely and the skilled man can per- 
form such a method using the present molecules as lead compounds with reference 
to the literature and well known techniques. In addition, the molecules according to 
the invention are also useful in screening for various substances. 

Further, vectors according to the invention may be used for expression of those 
proteins to provide immunogens for antibody production. Vectors encoding the pre- 
sent proteins and /or polypeptides are also useful for transforming cells in vitro or in 
vivo to express the proteins and/or polypeptides. 

Cells and/or tissues expressing the fused gene may be used to monitor expression 
levels of fused polypeptides in a wide variety of contexts. 

TT. The fused gene 

A) The human fused ge ne (FUH) 

SEQ ID NO 1 provides the cDNA sequence for the largest nucleic acid according to 
the invention, while SEQ ID NO 2 provides the corresponding protein sequence. 

B) Isolation of cDNA and/or probes. 

The nucleic acids of the present invention are cloned, or amplified by in vitro met- 
hods, such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), 
the transcription-based amplification system (TAS), the self-sustained sequence 
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replication system (SSR). A wide variety of cloning and in vitro amplification met- 
hodologies are well-known to persons of skill. Examples of these techniques and in- 
structions sufficient to direct persons of skill through many cloning exercises are 
found in Berger and Kirnrnel, Guide to Molecular Cloning Techniques, Methods in 
5 Enzymology 152 Academic Press, Inc., san Diego, CA (Berger); Sambrook et al. 
(1989) Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring harbor Press, NY, (Sambrook et al); Current Pro- 
tocols in Molecular Biology, F.M. Ausubel et al, Current Protocols, a joint venture 
between Greene Publishing Associates, Inc. And John Wiley & Sons, Inc., (1994 

10 Supplement) (Ausubel); Cashion et al., U.S. patent number 5,01 7,478; and Carr, Eu- 
ropean Patent No. 0,246,864. Examples of techniques sufficient to direct persons of 
skill through in vitro amplification methods are found in Berger, Sambrook, and Au- 
subel, as well as Mullis et al., (1 987) U.S. Patent No. 4,683,202; PCR Protocols A . 
Guide to Methods and Applications (Innis et al. Eds) Academic Press Inc. San Die- 

15 go, CA (1990) (Innis); Arnheim & Levinson (October 1 , 1990)C&EN 36-47; The 
Journal Of NIH Research (1991) 3: 81-94 (Kwoh et al (1989) Proc. Natl Acad. 
Sci. USA 86: 1 1 73; Guatelli et al (1990) Proc. Natl Acad. Sci. USA 87, 1874; Lo- 
mell etal (1989) J. Clin. Chem., 35: 1826; Landegren et al, (1988) Science, 241: 
1077-1080; Van Brunt (1990) Biotechnology, 8: 291-294; Wu and Wallace, (1989) 

20 Gene, 4: 560; and Barringer et al (1 990) Gene, 89: 1 1 7. 

In one preferred embodiment, the human fused cDNA can be isolated by routine 
cloning methods. The cDNA sequence provided in SEQ ID NO: 1 can be used to 
provide probes that specifically hybridize to the fused gene, in a genomic DNA 

25 sample, or to the fused mRNA, in a total RNA sample {e.g., in a Northern blot). On- 
ce the target fused nucleic acid is identified {e.g., in a Southern blot), it can be iso- 
lated according to standard methods known to those of skill in the art {see, e.g., 
Sambrook et al. (1989) i Molecular Cloning: A Laboratory Manual, 2nd Ed. Vols. 1- 
3, Cold Spring Harbor Laboratory; Berger and Kimmel (1987) Methods in Enzy- 

30 mology, Vol 152: Guide to Molecular Cloning Techniques, San Diego: Academic 
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press, Inc.; or Ausubel et al. (1987) Current Protocols in Molecular Biology, Gree- 
ne Publishing and Wiley-Interscience, New York). 

In another preferred embodiment, the human fused cDNA is isolated by amplifica- 
5 tion methods, such as polymerase chain reaction (PCR). 

C ) Labeling of nucleic acid probes. 

Where the fused cDNA or its subsequences are to be used as nucleic acid probes, it 
is often desirable to label the sequences with detectable labels. The labels may be 

1 0 incorporated by any of a number of means well known to those of skill in the art. 
However, in a preferred embodiment, the label is simultaneously incorporated dur- 
ing the amplification step in the preparation of the sample nucleic acids. Thus, for 
- example, polymerase chain reaction (PCR) with labeled primers or labeled nucleoti- 
des will provide a labeled amplification product. In another preferred embodiment, 

1 5 transcription amplification using a labeled nucleotide (e.g. fluorescein-labeled UTP 
and/or CTP) incorporates a label into the transcribed nucleic acids. 

Alternatively, a label may be added directly to an original nucleic acid sample (e.g., 
mRNA, polyA mRNA, cDNA, etc.) or to the amplification product after the ampli- 
20 fication is completed. Means of attaching labels to nucleic acids are well known to 
those of skill in the art and include, for example nick translation or end labeling (e.g. 
with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment 
(ligation) of a nucleic acid linker joining the sample nucleid acid to a label (e.g., a 
fiuorophore). 

25 

Detectable labels suitable for use in the present invention include any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electri- 
cal, optical or chemical means. Useful labels in the present invention include biotin 
for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabe- 
30 ads™), fluorescent dyes (e.g., fluorescein, texas red, thodamine, green fluorescent 
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protein, and the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes (e.g., hor- 
se radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), 
and colorimetric labels such as collodial gold or colored glass or plastic (e.g., po- 
lystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels 
5 incluse U.S. Patent Nos. 3,871 ,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. 

Means of detecting such labels are well known to those of skill in the art. Thus, for 
example, radiolabels may be detected using photographic film or scintillation coun- 
ters, fluorescent markers may be detected using a photodetector to detect emitted 
1 0 light- Enzymatic labels are typically detected by providing the enzyme with a sub- 
strate and detecting the reaction product produced by the action of the enzyme on 
the substrate, and colorimetric labels are detected by simply visualizing the colored 
label. 

15 III. Antibodies to the fused polypeptide(s). 

Antibodies are raised to the fused polypeptides of the present invention, including 
individual, ellelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, anti- 
bodies are raised to these polypeptides in either their native configurations or in 

20 non-native configurations: Anti-idiotypic antibodies can also be generated. Many 
methods of making antibodies are known to persons of skill. The following discus- 
sion is presented as a general overview of the techniques available; however, one of 
skill will recognize that many variations upon the following methods are known. 

25 A) Antibody Production 

A number of immunogens are used to produce antibodies specifically reactive with 
polypeptides according to the invention. Recombinant or synthetic polypeptides of 
' 10 amino acids in lengthy or greater/selected from amino acid sub-sequences of 
SEQ ID NO 2 are the preferred polypeptide immunogen (antigen) for the production 

30 of monoclonal or polyclonal antibodies. In one class of preferred embodiments, and 
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immunogenic peptide conjugate is also included as an immunogen. Naturally oc- 
curing polyptides are also used either in pure or impure form. 

Recombinant polypeptides are expressed in eukaryotic or prokaryotic cells (as 
5 described below) and purified using standard techniques. The polypeptide, or a 
synthetic version thereof, is then injected into an animal capable of producing anti- 
bodies. Either monoclonal or polyclonal antibodies can be generated for subsequent 
use in immunoassays to measure the presence and quantity of the polypetide. 

10 Methods of producing polyclonal antibodies are known to those of skill in the art. In 
brief, an immunogen (antigen), preferably a purified polypeptide, a polypeptide 
coupled to an appropriate carrier (e.g., GST, keyhole limpet hemanocyanin, etc.), or 
a polypeptide incorporated into an immunization vector such as a recombinant vac- 
cinia virus (see, U.S. Patent No. 4,722,848) is mixed with an adjuvant and animals 

1 5 are immunized with the mixture. The animal's immune response to the immunogen 
preparation is monitored by taking test bleeds and determining the titer of reactivity 
to the polypeptide of interest. When appropriately high titers of antibody to the im- 
munogen are obtained, blood is collected from the animal and antisera are prepared. 
Further fractionation of the antisera to enrich for antibodies reactive to the polypep- 

20 tide is performed where desired (see, e.g., Coligan (1991) Current Protocols in Im- 
munology Wiley/Greene, NY; and Harlow and Lane (1989) Antibodies: A Laborato- 
ry Manual Cold Spring Harbor Press, NY). 

Antibodies, including binding fragments and single chain recombinant versions the- 
25 reof, against predetermined fragments of the present polypeptides are raised by im- 
munizing animals, e.g., with conjugates of the fragments with carrier proteins as 
described above. Typically, the immunogen of interest is a peptide of at least about 
5 amino acids," more typically the peptide is 10 amino acids in length, preferably, the 
fragment is 15 amino acids in length and more preferably the fragment is 20 amino 
30 acids in length or greater. The peptides are typically coupled to a carrier protein 
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(e.g., as a fusion protein), or are recombinantly expressed in a immunization vector. 
Antigenic determinants on peptides to which antibodies bind are typically 3 to 10 
amino acids in length. 

5 Monoclonal antibodies are prepared from cells secreting the desired antibody. These 
antibodies are screened for binding to normal or modified polypeptides, or screened 
for agonistic or antagonistic activity, e.g., activity mediated through a fused protein. 
Specific monoclonal and polyclonal antibodies will usually bind with a K D of at le- 
ast about 0.1 mM, more usually at least about 50 uM, and most preferably at least 
1 0 about 1 uM or better. 

In some instances, it is desirable to prepare monoclonal antibodies from various 
mammalian hosts, such as mice, rodents, primates, humans, etc. Description of tech- 
niques for preparing such monoclonal antibodies are found in, e.g. Stites et al. (eds) 

1 5 Basic and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, 
CA, and references cited therin; Harlow and Lane, supra; Goding ( 1 986) Monoclo- 
nal Antiboides: Principles and Practice (2d ed.) Academic Press, New York, NY; 
and Kohler and Milstein (1975) Nature 256: 495-497. Summarized briefly, this 
method proceeds by injecting an animal with an immunogen. The animal is then sa- 

20 crificed and cells taken from its spleen, which are fused with myeloma cells. The 
result is a hybrid cell or "hybridoma" that is capable of reproducing in vitro. The 
population of hybridomas is then screened to isolate individual clones, each of 
which secrete a single antibody species to the immunogen. In this manner, the indi- 
vidual antibody species obtained are the products of immortalized and cloned single 

25 B cells from the immune animal generated in response to a specific site recognized 
on the immunogenic substance. 

Alternative methods of immortalization include transformation "with Epstein Barf 
Virus, oncogenes, or retroviruses, or other methods known in the art. Colonies 
30 arising from single immortalized cells are screened for production of antibodies of 
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the desired specificity and affinity for the antigen, and yield of the monoclonal anti- 
bodies produced by such cells in enhanced by various techniques, including injec- 
tion into the peritoneal cavity of a vertebrate (preferably mammalian) host. The po- 
lypetides and antibodies of the present invention are used with or without modifica- 
5 tion, and include chimeric antibodies such as humanized murine antibodies. 

Other suitable techniques involve selection of libraries of recombinant antibodies in 
phage or similar vectors (see, e.g., Huse etal (1989) Science 246: 1275-1281; and 
Ward, et al. (1989) Nature 341 : 544-546; and Vaughan et al (1996) Nature Bio- 
10 technology, 14: 309-314). 

Frequently, the polypetides and antibodies will be labeled by joining, either cova- 
lently or non-covalently, a substance which provides for a detectable signal. A wide 
variety of labels, and conjugation techniques are known and are reported extensively 
1 5 in both the scientific and patent literature. Suitable labels include radionuclides, 
enzymes, substrates, cofactors, inhibitors, fluorescent moieties, chemiluminescent 
moieties, magnetic particles, and the like. Patents teaching the use of such labels 
include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. Also recombinant immunoglobulins may be produced. 
20 See, Cabalitty, U.S. Patent No. 4,816,567; and Queen et al (1989) Proc. Nat'l Acad. 
Sci. USA 86: 10029-10033. 

The antibodies of this invention are also used for affinity chromatography in isola- 
ting fused polypeptides. Columns are prepared, e.g., with the antibodies linked to a 
25 solid support, e.g., particles, such as agarose, Sephadex, or the like, where a cell ly- 
sate is passed through the column, washed, and treated with increasing concentra- 
tions of a mild denaturant, whereby purified fused polypeptides are released. 

The antibodies can be used to screen expression libraries for particular expression 
30 products such as normal or abnormal human fused protein. Usually the antibodies in 
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such a procedure are labeled with a moiety allowing easy detection of presence of 
antigen by antibody binding. 

Antibodies raised against fused polypeptides can also be used to raise antiidiotype 
5 antibodies. These are useful for detecting or diagnosing various pathological condi- 
tions related to the presence of the respective antigens. 

B) Human or humanized (chimeric) antibody production. 

The antibodies of this invention can also be administered to an organism (e.g., a 

10 human patient) for therapeutic purposes. Antibodies administered to an organism 
other than the species in which they are raised are often immunogenic. Thus, for ex- 
ample, murine antibodies administered to a human often induce an immunologic 
response against the antibody {e.g., the human anti-mouse antibody (HAMA) res- 
ponse) on multiple administrations. The immunogenic properties of the antibody are 

1 5 reduced by altering portions, or all, of the antibody into characteristically human 
sequences thereby producing chimeric or human antibodies, respectively. 

i) Humanized (chimeric) antibodies. 

Humanized (chimeric) antibodies are immunoglobulin molecules comprising a hu- 
20 man and non-human portion. More specifically, the antigen combining region (or 
variable region) of a humanized chimeric antibody is derived from a non-human 
source {e.g., murine) and the constant region of the chimeric antibody (which con- 
fers biological effector function to the immunoglobulin) is derived from a human 
source. The humanized chimeric antibody should have the antigen binding specifi- 
25 city of the non-human antibody molecule and the effector function conferred by the 
human antibody molecule. A large numer of methods of generating chimeric antibo- 
dies are well known to those of skill in the art {see, e.g., U.S. Patent Nos: 5,502,167, 
' 5,500,362, 47491,088, 5,482,856,-5,472,693; 5,354,847; 5,292,867, 5,23 1,026; 
5,204,244,5,202,238,5,169,939,5,081,235, 5,075,431, and 4,975,369). 



30 
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In general, the procedures used to produce these chimeric antibodies consist of the 
following steps (the order of some steps may be interchanged): (a) identifying and 
cloning the correct gene segment encoding the antigen binding portion of the anti- 
body molecule; this gene segment (known as the DVJ, variable, diversity and joi- 

5 ning regions for heavy chains or VJ, variable, joining regions, for light chains (or 
simply as the V or Variable region) may be in either the cDNA or genomic form; (b) 
cloning the gene segments encoding the constant region or desired part thereof; (c) 
ligating the variable region to the constant region so that the complete chimeric an- 
tibody is encoded in a transcribable and translatable form; (d) ligating this construct 

1 0 into a vector containing a selectable marker and gene control regions such as pro- 
moters, enhancers and poly(A) addition signals; (e) amplifying this construct in a 
host cell (e.g., bacteria; (f) introducing the DNA into eukaryotic cells 
(transfections) most often mammalian lymphocytes; and culturing the host cell un- 
der conditions suitable for expression of the chimeric antibody. 

15 

Antibodies of several distinct antigen bindning specificities have been manipulated 
by these protocols to produce chimeric proteins (e.g., anti-TNP: Boulianne et al 
(1984) Nature, 312: 643; and anti-tumor antigens: Sahagan et al. (1986) J. Immu- 
nol, 137: 1066). Likewise several different effector functions have been achieved 
20 by linking new sequences to those encoding the antigen binding region. Some of 
these inlcude enzymes (Neuberger et al. (1984) Nature 312:604), immunoglobulin 
constant regions from another species and constant regions of another immunoglo- 
bulin chain (Sharon et al. (1984) Nature 309: 364; Tan et al, (1985) J. Immunol. 
135: 3565-3567). 

25 

In one preferred embodiment, a recombinant DNA vector is used to transfect a cell 
line that produces an anti-fused antibody. The novel recombinant DNA vector con- 
tains a "replacement gene" to replace all or a portion of the gene encoding the im- 
munoglobulin constant region in the cell line (e.g., a replacement gene may encode 
30 all or a portion of a constant region of a human immunoglobulin, a specific immu- 
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noglobulin class, or an enzyme, a toxin, a biologically active peptide, a growth fac- 
tor, inhibitor, or a linker peptide to facilitate conjugation to a drug, toxin, or other 
molecule, etc.), and a "target sequence" which allows for targeted homologous 
recombination with immunoglobulin sequences within the antibody producing cell. 

5 

In another embodiment, a recombinant DNA vector is used to transfect a cell line 
that produces an antibody having a desired effector function, (e.g., a constant region 
of a human immunoglobulin) in which case, the replacement gene contained in the 
recombinant vector may encode all or a portion of a region of an anti-fused antibody 

1 0 and the target sequence contained in the recombinant vector allows for homologous 
recombination and targeted gene modification within the antibody producing cell. In 
either embodiment, when only a portion of the variable or constant region is repla- 
ced, the resulting chimeric antibody may define the same antigen and/or have the 
same effector function yet be altered or improved so that the chimeric antibody may 

15 demonstrate a greater antigen specificity, greater affinity binding constant, increased 
effector function, or increased secretion and production by the transfected antibody 
producing cell line, etc. 

Regardless of the embodiment practiced, the processes of selection for integrated 
20 DNA (via a selectable marker), screening for chimeric antibody production, and cell 
cloning, can be used to obtain a clone of cells producing the chimeric antibody. 

Thus, a piece of DNA which encodes a modification for a monoclonal antibody can 
be targeted directly to the site of the expressed immunoglobulin gene within a B-cell 

25 or hybridoma cell line. DNA constructs for any particular modification may be used 
to alter the protein product of any monoclonal cell line or hybridoma. Such a proce- 
dure circumvents the costly and time consuming task of cloning both heavy and 
light chain variable region genes from each B-cell close expressing a useful antigen 
specificity. In addition to circumventing the process of cloning variable region ge- 

30 nes. the level of expression of chimeric antibody should be higher when the gene is 
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at its natural chromosomal location rather than at a random position. Detailed met- 
hods for preparation of chimeric (humanized) antibodies can be found in U.S. Patent 
5,482,856. 

5 ii) Human antibodies. 

In another embodiment, this invention provides for fully human anti-fused antibodi- 
es. Human antibodies consist entirely of characteristically human polypeptide 
sequences. The human antibodies of this invention can be produced in using a wide 
variety of methods (see, e.g., Larrick et al, U.S. Pat. No 5,001,065, for review). 
1 0 In one preferred embodiment, the human antibodies of the present invention are 
usually produced initially in trioma cells. Genes encoding the antibodies are then 
cloned and expressed in other cells, particularly, nonhuman mammalian cells. 

The general approach for producing human antibodies by trioma technology has be- 
15 en described by Ostberg et al. (1983) Hybridoma 2: 361-367, Ostberg, U.S. Pat. No. 
4,634,664, and Engelman et al. U.S. Pat. No. 4,634,666. The antibody-producing 
cell lines obtained by this method are called triomas because they are descended 
from three cells; two human and one mouse. Triomas have been found to produce 
antibody more stably than ordinary hybridomas made from human cells. 

20 

Preparation of trioma cells requires an initial fusion of a mouse myeloma cell line 
with unimmunized human peripheral B lymphocytes. This fusion generates a xeno- 
genic hybrid cell containing both human and mouse chromosomes (see, Engelman, 
supra.). Xenogenic cells that have lost the capacity to secrete antibodies are selec- 
25 ted. Preferably, a xenogenic cell is selected that is resistant to 8-azaguanine. Cells 
possessing resistance to 8-azaguanine are unable to propagate on hypoxanthine- 
aminopterin-thymidine (HAT) or azaserine-hypoxanthine (AH) media. 

The capacity to secrete antibodies is conferred by a further fusion between the xe- 
30 nogenic cell and B-lymphocytes immunized against a polypeptide according to the 
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invention or an epitope thereof. The B -lymphocytes are obtained from the spleen, 
blood or lymph nodes of human donor. If antibodies against a specific antigen or 
epitope are desired, it is preferable to use that antigen or epitope thereof as the im- 
munogen rather than fused polypeptide. Alternatively, B-lymphocytes are obtained 
5 from an unimmunized individual and stimulated with the present polypeptide, or an 
epitope thereof, in vitro. In a further variation, B-lymphocytes are obtained from an 
infected, or otherwise immunized individual, and then hyperimmunized by exposure 
to a fused polypeptide for about seven to fourteen days, in vitro. 
The immunized B-lymphocytes prepared by one of the above procedures are fused 

1 0 with a xenogenic hybrid cell by well known methods. For example, the cells are tre- 
ated with 40-50% polyethylene glycol of MW 1000-4000, at about 37°C for about 
5-10 min. Cells are separated from the fusion mixture and propagated in media se- 
lective for the desired hybrids. When the xenogenic hybrid cell is resistant to 8- 
azaguanine, immortalized trioma cells are conveniently selected by successive pas- 

1 5 sage of cells on HAT or AH medium. Other selective procedures are, of course, pos- 
sible depending on the nature of the cells used in fusion. Clones secreting antibodies 
having the required binding specificity are identified by assaying the trioma culture 
medium for the ability to bind to the present polypeptide or an epitope thereof. Tri- 
omas producing human antibodies having the desired specificity are subcloned by 

20 the limiting dilution technique and grown in vitro in culture medium, or are injected 
into selected host animals and grown in vitro. 

The trioma cell lines obtained are then tested for the ability to bind a polypeptide or 
an epitope thereof. Antibodies are separated from the resulting culture medium or 
25 body fluids by conventional antibody-fractionation procedures, such as ammonium 
sulfate precipitation, DEAE cellulose chromatography and affinity chromatography. 

Although triomas are genetically stable they do hot produce atitibodiesat very high 
levels. Expression levels can be increased by cloning antibodies genes from the tri- 
30 oma into one or more expression vectors, and transforming the vector into a cell 
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line, such as the cell lines typically used for expression of recombinant or huma- 
nized immunoglobulins. As well as increasing yield of antibody, this strategy offers 
the additional advantage that immunoglobulins are obtained from a cell line that do- 
es not have a human component, and does not therefore need to be subjected to the 
5 especially extensive viral screening required for human cell lines. 

The genes encoding the heavy and light chains of immunoglobulins secreted by tri- 
oma cell lines are cloned according to methods, including the polymerase chain re- 
action, known in the art {see, e.g., Sambrook et al, Molecular Cloning: A Labor ato- 

1 0 ry Manual, 2nd ed., Cold Spring Harbor, N.Y., 1 989; Berger & Kimmel, Methods in 
Enzymology, Vol. 152: Guide to Molecular Cloning Techniques, Academic Press, 
Inc., San Diego, Calif., 1987; Co et al. (1992) J. Immunol, 148: 1 149). For examp- 
le, genes encoding heavy and light chains are cloned from a trioma's genomic DNA 
or cDNA produced by reverse transcription of the trioma's RNA. Cloning is ac- 

1 5 complished by conventional techniques including the use of PCR primers that hyb- 
ridize to the sequences flanking or overlapping the genes, or segments of genes, to 
be cloned. 



Typically, recombinant constructs comprise DNA segments encoding a complete 
20 human immunoglobulin heavy chain and/or a complete human immunoglobulin 
light chain of an immunoglobulin expressed by a trioma cell line. Alternatively, 
DNA segments encoding only a portion of the primary antibody genes are produced, 
which portions possess binding and/or effector activities. Other recombinant 
constructs contain segments of trioma cell line immunoglobulin genes fused to seg- 
25 ments of other immunoglobulin genes, particularly segments of other human 
constant region requences (heavy and/or light chain). Human constant region 
sequences can be selected from various reference sources, including but not limited 
to those listed in Kabat et al: (1 987) Sequences of Proteins of Immunological Inte- 
rest, U.S. Department of Health and Human Services. 

30 
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In addition to the DNA segments encoding an\\-fused immunoglobulins or frag- 
ments therof, other substantially homologous modified immunoglobulins can be re- 
adily designed and manufactured utilizing various recombinant DNA techniques 
known to those skilled in the art such as site-directed mutagenesis (see Gillman & 

5 Smith (1979) Gene, 8: 81-97; Roberts et ah (1987) Nature 328: 731-734). Such mo- 
dified segments will usually retain antigen binding capacity and/or effector function. 
Moreover, the modified segments are usually not so far changed from the original 
trioma genomic sequences to prevent hybridization to these sequences under strin- 
gent conditions. Because, like many genes, immunoglobulin genes contain separate 

1 0 functional regions, each having one or more distinct biological activites, the genes 
may be fused to functional regions from other genes to produce fusion proteins (e.g., 
immunotoxins) having novel properties or novel combinations of properties. 

The recombinant polynucleotide constructs will typically include an expression 
1 5 control sequence operably linked to the coding sequences, including naturally asso- 
ciated or heterologous promoter regions. Preferably, the expression control sequen- 
ces will be eukaryotic promoter systems in vectors capable of transforming or 
transfecting eukaryotic host cells. Once the vector has been incorporated into the 
appropriate host, the host is maintained under conditions suitable for high level ex- 
20 pression of the nucleotide sequences, and the collection and purification of the hu- 
man mti-fused immunoglobulins. 

These expression vectors are typically replicable in the host organisms either as epi- 
somes or as an integral part of the host chromosomal DNA.. Commonly, expression 
25 vectors will contain selection markers, e.g. ampicillin-resistance or hygromycin- 
resistance, to permit detection of those cells transformed with the desired DNA 
sequences. 

In general, prokaryotes can be used for cloning the DNA sequences encoding a hu- 
30 man anli-fused immunoglobulin chain. E. coli is one prokaryotic host particularly 
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useful for cloning the DNA sequences of the present invention. Microbes, such as 
yeast are also useful for expression. Saccharomyces is a preferred yeast host, with 
suitable vectors having expression control sequences, an origin of replication, ter- 
mination sequences and the like as desired. Typical promoters include 3- 
5 phosphoglycerate kinase and other glycolytic en2ymes. Inducible yeast promoters 
include, among others, promoters from alcohol dehydrogenase 2, isocytochrome C, 
and enzymes responsible for maltose and galactose utilization. 

Mammalian cells are a particularly preferred host for expressing nucleotide seg- 
10 ments encoding immunoglobulins or fragments thereof {see, e.g. Winnacker, From 
Genes to Clones, VCH Publishers, N.Y., 1987). A number of suitable host cell lines 
capable of secreting intact heterologous proteins have been developed in the art, and 
include CHO cell lines, various COS cell lines, HeLa cells, L cells and myeloma 
cell lines. Preferably, the cells are nonhuman. Expression vectors for these cells can 
1 5 include expression control sequences, such as an origin of replication , a promoter, 
an enhancer (Queen et a!. (1986) Immunol. Rev. 89:49), and necessary processing 
information sites, such as ribosome binding sites, RNA splice sites, polyadenylation 
sites, and transcriptional terminator sequences. Preferred expression control sequen- 
ces are promoters derived from endogenous genes, cytomegalovirus, SV40, adeno- 
20 virus, bovine pappillomavirus, and the like (see, e.g., Co et al. (1992) J. Immunol. 
1458: 1149). 

The vectors containing the DNA segments of interest can be transferred into the host 
cell by well-known methods, depending on the type of cellular host. For example, 
25 calcium chloride transfection is commonly utilized for prokaryotic cells, whereas 
calcium phosphate treatment, electroporation, lipofection, biolistics or viral-based 
transfection may be used for other cellular hosts. Other methods used to transform 
mammalian cells include the use of polybrene, protoplast fusion, liposomes, electro- 
poration, and microinjection (see, generally, Sambrook et al, supra). 
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Once expressed, human anti-fused immunoglobulins of the invention can be purified 
according to standard procedures of the art, including HPLC purification, fraction 
column chromatography, gel electrophoresis and the like (see, generally, Scopes, 
Protein Purification, Springer-Verlag, NY, 1982). Detailed protocols for the pro- 
duction of human antibodies can be found in U.S. Patent 5,506,132. 

Other approaches in vitro immunization of human blood. In this approach, human 
blood lymphocytes capable of producing human antibodies are produced. Human 
peripheral blood is collected from the patient and is treated to recover mononuclear 
cells. The suppressor T-cells then are removed and remaining cells are suspended in 
a tissue culture medium to which is added the antigen and autologous serum and, 
preferably, a nonspecific lymphocyte activator. The cells then are incubated for a 
period of time so that they produce the specific antibody desired. The cells then can 
be fused to human myeloma cells to immortalize the cell line, thereby to permit 
continuous production of antibody (see U.S. Patent 4,716,1 1 1). 

In another approach, mouse-human hybridomas which produces human anti-fused 
are prepared (see, e.g. 5,506,132). Other approaches include immunization of muri- 
nes transformed to express human immunoglobulin genes, and phage display scre- 
ening (V aughan et al. Supra). 

IV. Expression of fused polypeptides. 
A) De novo chemical synthesis. 

The present proteins or subsequences thereof may be synthesized using standard 
chemical peptide synthesis techniques. Where the desired subsequnces are relatively 
short (e.g., when a particular antigenic determinant is desired) the molecule may be 
synthesized as a single contiguous polypeptide. Where larger molecules are desired, 
subsequences can be synthesized separately (in one or more units) and then fused by 
condensation of the amino terminus of one molecule with the carboxyl terminus of 
the other molecule thereby forming a peptide bond. 
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Solid phase synthesis in wich the C-terminal amino acid of the sequence is attached 
to an insoluble support followed by sequential addition of the remaining amino acids 
in the sequence is the preferred method for the chemical synthesis of the polypepti- 
5 des of this invention. Techniques for solid phase synthesis are described by Barany 
and Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, 
Synthesis, Biology. Vol. w: Special Methods in peptide Synthesis, Part A., Merrifi- 
led, et al. J. Am. Chem. Soc, 85: 2149-2156 (1963), and Stewart et al, Solid Phase 
Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 1 1 1 .( 1 984). 

10 B. Recombinant expression. 

In a preferred embodiment, the present proteins or subsequences thereof are synthe- 
sized using recombinant DNA methodology. Generally this involves creating a 
DNA sequence that encodes the fusion protein, placing the DNA in an expression 
casette under the control of a particular promoter, expressing the protein in a host, 

1 5 isolating the expressed protein and, if required, renaturing the protein. 

DNA encoding the present proteins or subsequences of this invention may be prepa- 
red by any suitable method as described above, including, for example, cloning and 
restriction of appropriate sequences or direct chemical synthesis by methods such as 
20 the phosphotriester method of Narang et al. Meth. Enzymol. 68: 90-99 (1979); the 
phosphodiester method of Brown et al, Meth Enzymol. 68: 109-151 (1979); the di- 
ethylphosphoramidite method of Beaucage et al, Tetra . Lett., 22: 1859-1862 
(1981); and the solid support method of U.S. Patent No. 4,458,066. 

25 Chemical synthesis produces a single stranded oligonucleotide. This may be con- 
verted into double stranded DNA by hybridization with a complementary sequence, 
or by polymerization with a DNA polymerase using the single strand as a template. 
"One of skill would recognize that while chemical synthesis of DNA is limited to 
sequences of about 1 00 bases, longer sequences may be obtained by the ligation of 

30 shorter sequences. 
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Alternatively, subsequences may be cloned and the appropriate subsequences clea- 
ved using appropriate restriction enzymes. The fragments may then be ligated to 
produce the desired DNA sequence. 

5 

In one embodiment, proteins of this invention may be cloned using DNA amplifica- 
tion methods such as polymerase chain reaction (PCR). Thus, for example, the nuc- 
leic acid sequence or subsequence is PCR amplified, using a sense primer contai- 
ning one restriction site (e.g., Ndel) and an antisense primer containing another re- 

10 striction site (e.g., HindlTI). This will produce a nucleic acid encoding the desired 
sequence or subsequence and having terminal restriction sites. This nucleic acid can 
then be easily ligated into a vector containing a nucleic acid encoding the second 
molecule and having the appropriate corresponding restriction sites. Suitable PCR 
primers can be determined by one of skill in the art using the Sequence information 

1 5 provided in SEQ ED No: 1 . Appropriate restriction sites can also be added to the 

nucleic acid encoding the protein or protein subsequence by site-directed mutagene- 
sis. The plasmid containing the sequence or subsequence is cleaved with the appro- 
priate restriction endonuclease and then ligated into the vector encoding the second 
molecule according to standard methods. 

20 

The nucleic acid sequences encoding fused proteins or protein subsequences may 
be expressed in a variety of host cells, including E. coli, other bacterial hosts, yeast, 
and various higher eukaryotic cells such as the COS, CHO and HeLa cells lines and 
myeloma cell lines. As the present proteins are typically found in eukaryotes, a eu- 
25 karyote host is preferred. The recombinant protein gene will be operably linked to 
appropriate expression control sequences for each host. For E. coli this includes a 
promoter such as the T7, trp, or lambda promoters, a ribosome binding site and 
preferably a transcription termination signal. For eukaryotic cells, the control 
sequences will include a promoter and preferably and enhancer derived from immu- 
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noglobulin genes, SV40, cytomegalovirus, etc., and a polyadenylation sequence, and 
may include splice donor and acceptor sequences. 

The plasmids of the invention can be transferred into the chosen host cell by well- 
5 known methods such as calcium chloride transformation for E. coli and calcium 
phosphate treatment or electroporation for mammalian cells. Cells transformed by 
the plasmids can be selected by resistance to antibiotics conferred by genes contai- 
ned on the plasmids, such as the amp, gpt, neo and hyg genes. 

10 Once expressed, the recombinant proteins according to the invention can be purified 
according to standard procedures of the art, including ammonium sulfate precipita- 
tion, affinity columns, column chromatography, gel electrophoresis and the like 
(see, generally, R. Scopes, Protein Purification, springer-Verlag, N.Y. (1982), 
Deutscher, Methods in Enzymology Vol. 183: Guide to Protein Purification., Aca- 

15 demic Press, Inc. N.Y. (1990)). Substantially pure compositions of at least about 90 
to 95% homogeneity are preferred, and 98 to 99% or more homogeneity are most 
preferred. Once purified, partially or to homogeneity as desired, the polypeptides 
may then be used {e.g., immunogens for antibody production). 

20 One of skill in the art would recognize that after chemical synthesis, biological ex- 
pression, or purification, the present protein(s) may possess a conformation substan- 
tially different than the native conformations of the constituent polypeptides. In this 
case, it may be necessary to denature and reduce the polypeptide and then to cause 
the polypeptide to re-fold into the preferred conformation. Methods or reducing and 

25 denaturing proteins and inducing re-folding are well known to those of skill in the 
art (See, Debinski et al. (1993) J. Biol. Chem., 268: 14065-14070; Kreitman and 
pastan (1993) Bioconjug. Chem., 4: 581-585; and Buchner, et al, (1992) Anal. Bi- 
ochem., 205: 263-270). Debinski et al., for example,- describe the denaturation and 
reduction of inclusion body proteins in guanidine-DTE. The protein is then refolded 

30 in a redox buffer containing oxidized glutathione and L-arginine. 
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One of skill would recognize that modifications can be made to the fused proteins 
without diminishing their biological activity. Some modifications may be made to 
facilitate the cloning, expression, or incorporation of the targeting molecule into a 
fusion protein. Such modifications are well known to those of skill in the art and 
include, for example, a methionine added at the amino terminus to provide an initia- 
tion site, or additional amino acids {e.g., poly His) placed on either terminus to cre- 
ate conveniently located restriction sites or termination codons or purification 
sequences. 

V. Detection of fused 

As indicated above, abnormal {e.g., altered or deficient) expression of the human 
fused gene is beleived to be a causal factor in the development of several different 
conditions, such as cancers. 

Thus, it is desirable to determine the presence or absence, or quantify, the expres- 
sion of fused polypeptides of the nucleic acids encoding the fused polypeptides. This 
may be accomplished by assaying the gene product; fused polypeptides themselves, 
or alternatively, by assaying the nucleic acids (DNA or mRNA) that encode the fu- 
sed polypeptides. In particular, it is desirable to determine whether fused expression 
is present, absent, or abnormal {e.g., because of an abnormal gene product or becau- 
se of abnormal expression levels as, for example, with a hemizygous gene). Parti- 
cularly, where it is desired to determine a heritable propensity for abnormal fused 
gene expression, it is preferred to assay the host DNA for abnormal fused genes or 
gene transcripts (mRNAs). 

A) Sample Collection and Processing 

The fused gene or gene product {i.e., mRNA or polypeptide) is preferably detected 
and/or quantified in a biological sample. As used herein, a biological sample is a 
sample of biological tissue or fluid that, in a healthy and/or pathological state, con- 
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tains a fused nucleic acid or polypeptide. Such samples include, but are not limited 
to, sputum, amniotic fluid, blood, blood cells (e.g., white cells), tissue or fine needle 
biopsy samples, urine, peritoneal fluid, and pleural fluid, or cells therefrom. Biolo- 
gical samples may also include sections of tissues such as frozen sections taken for 
5 histological purposes. Although the sample is typically taken from a human patient, 
the assays can be used to detect fused genes or gene products in samples from any 
mammal, such as dogs, cats, sheep, cattle, and pigs. 

The sample may be pretreated as necessary by dilution in a appropriate buffer solu- 
1 0 tion or concentrated, if desired. Any of a number of standard aqueous buffer solu- 
tions, employing one of a variety of buffers, such as phosphate, Tris, or the like, at 
physiological pH can be used. 

B) Nucleic acid assays. 

1 5 In one embodiment, this invention provides for methods of detecting and/or quanti- 
fying human fused expression by assaying the xmderly'mgfused gene (or a fragment 
thereof) or by assaying the fused gene transcrip (mRNA). The assay can be for the 
presence or absence of the normal gene or gene product, for the presence or absence 
of an abnormal gene or gene product, or quantification of the transcription levels of 

20 normal or abnormal fused gene product. 

0 Nucleic acid sample. 

In a preferred embodiment, nucleic acid assays are performed with a sample of 
nucleic acid isolated from the organism to be tested. In the simplest embodiment, 
25 such a nucleic acid sample is the total mRNA isolated from a biological sample. The 
nucleic acid (e.g., either genomic DNA or mRNA) may be isolated from the sample 
according to any of a number of methods well known to those of skill in the art. 
One of skill will appreciate that where alterations in the copy number of the fused 
gene are to be detected genomic DNA is preferably isolated. Conversely, where ex- 
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pression levels of a gene or genes are to be detected, preferably RNA (mRNA) is 
isolated. 

Methods of isolating total DNA or mRNA are well known to those of skill in the art. 
5 For example, methods of isolation and purificaytion of nucleic acids are described in 
detail in Chapter 3 of Laboratory Techniques in Biochemistry and Molecular Biolo- 
gy: Hybridization With Nucleic Acid Probes, part 1. Theory and Nucleic Acid Pre- 
paration, P. Tijssen, ed. elsevier, N.Y. (1993) and Chapter 3 of Laboratory Tech- 
niques in Biochemistry and Molecular Biology: Gybridization With Nucleic Acid 
1 0 Probes, Part I. Theory and Nucleic Acid Preparation. P. Tijssen ed. Elsevier, N.Y. 
(1993)). 

In a preferred embodiment, the total nucleic acid is isolated from a given sample 
using, for example, an acid guanidinium-phenol-chloroform extraction method and 
poly A + mRNA is isolated by oligo dT column chromatography or by using (dT)n 
1 5 magnetic beads {see, e.g., Sambrook et ah, Molecular Cloning: A Laboratory Ma- 
nual 2nd ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989), or Current Pro- 
tocols in Molecular Biology, F. Ausubel et ah, ed. Greene Publishing and Wiley- 
Interscience, New York (1987)). 

20 Frequently, it is desirable to amplify the nucleic acid sample prior to hybridization. 
One of skill in the art will appreciate that whatever amplification method is used, if 
a quantitative result is desired, care must be taken to use a method that maintains or 
controls for the relative frequencies of the amplified nucleic acids. 

25 Methods of "quantitative" amplification are well known to those of skill in the art. 
For example, quantitative PCR involves simultaneously co-amplifying a known qu- 
antity of a control sequence using the same primers. This provides an internal stan- 
dard that may be used to calibrate the PCR reaction. The high density array may 
then include probes specific to the internal standard for quantification of the ampli- 

30 fied nucleic acid. 
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One preferred internal standard is a synthetic AW 106 cRNA. The AW106 cRNA is 
combined with RNA isolated from the sample according to standard techniques 
known to those of skill in the art. The RNA is then reverse transcribed using a rever- 
5 se transcriptase to provide copy DNA. The cDNA sequences are then amplified 

(e.g., by PCR) using labeled primers. The amplification products are separated, typi- 
cally by electrophoresis, and the amount of radioactivity (proportional to the amount 
of amplified product) is determined. The amount of mRNA in the sample is then 
calculated by comparison with the signal produced by the known AW 106 RNA 

10 standard. Detailed protocols for quantitative PCR are provided in PCR Protocols, A 
guide to Methods and Applications, Innis etal, Academic Press, Inc. N.Y., (1990). 
Other suitable amplification methods include, but are not limited to polymerase 
chain reaction (PCR) (Innis et al, (1990) PCR Protocols. A guide to Methods and 
Application. Academic Press, Inc. San Diego), ligase chain reaction (LCR) (see Wu 

15 and Wallace (1989) Genomics 4: 560, Landegren et al. (1988) Science 241: 1077, 
and Barringer et al. (1990) Gene 89: 117, transcription amplification (Kwoh et al. 
(1989) Proc. Natl. Acad. Sci. USA 86: 1173), and self-sustained sequence replica- 
tion (Guatelli etal. (1990) Proc. Nat. Acad. Sci. USA 87: 1874). 

20 n) Hybridization assays. 

A variety of methods for specific DNA and RNA measurement using nucleic acid 
hybridization techniques are known to those of skill in the art (see Sambrook et al. 
Supra). For example, one method for evaluating the presence, absence, or quantity 
of DNA encoding./wsed proteins in a sample involves a Southern transfer. Briefly, 

25 the digested genomic DNA is run on agarose slab gels in buffer and transferred to 
membranes. 

Hybridization is carried our using the nucleic acid probes specific for the target fu- 
sed sequence or subsequence. Nucleic acid probes are designed based on the nucleic 
30 acid sequences encoding fused proteins (see SEQ ID NO: 2). The probes can be full 
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length of the nucleic acid sequence encoding the fused protein. Shorter probes are 
empirically tested for specificity. Preferably nucleic acid probes are 20 bases or 
longer in length. (See Sambrook et al. For methods of selecting nucleic acid probe 
sequences for use in nucleic acid hybridization.) Visualization of the hybridized 
5 portions allows the qualitative determination of the presence or absence of DNA en- 
coding fused proteins. 

Similarly, a Northern transfer may be used for the detection of mRNA encoding/w- 
sed proteins. In brief, the mRNA is isolated from a given cell sample using, for ex- 
1 0 ample, and acid guanidinium-phenol-chloroform extraction method. The mRNA is 
then electrophoresed to separate the mRNA species and the mRNA is transferred 
from the gel to a nitrocellulose membrane. As with the Southern blots, labeled pro- 
bes are used to identify the presence or absence of fused proteins. 

1 5 A variety of nucleic acid hybridization formats are known to those skilled in the art. 
For example, common formats include sandwich assays and competition or displa- 
cement assays. Hybridization techniques are generally described in ""Nucleic Acid 
Hybridization, A Practical Approach," Ed. Hames, B.D. and Higgins, S.J., IRL 
Press, (1985); Gall and Pardue Proc. Natl. Acad. Sci. U.S.A. 63: 378-383 (1969); 

20 and John et al. Nature 223: 582-587 ( 1 969). 

For example, sandwich assays are commercially useful hybridization assays for de- 
tecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic 
acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in 
25 solution. The clinical sample will provide the target nucleic acid. The "capture" 

nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid to 
form a "sandwich" hybridization complex. To be effective, the signal nucleic acid 
cannot hybridize with the capture nucleic acid. - -- -- - 
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Typically, labeled signal nucleic acids are used to detect hybridization. Comple- 
mentary nucleic acids or signal nucleic acids may be labeled by any one of several 
methods typically used to detect the presence of hybridized polynucleotides. The 
most common method of detection is the use of autoradiography with 3 H, 125 1, 35 S, 
5 14 C, or 32 P-labelled probes or the like. Other labels include ligands which bind to la- 
beled antibodies, fluorophores, chemiluminescent agents, enzymes, and antibodies 
which can serve as specific binding pair members for a labeled ligand. 

Detection of a hybridization complex may require the binding of a signal generating 
10 complex to a duplex of target and probe polynucleotides or nucleic acids. Typically, 
such binding occurs through ligand and anti-ligand interactions as between a ligand- 
conjugated probe and an anti-ligand conjugated with a signal. 
The label may also allow indirect detection of the hybridization complex. For ex- 
ample, where the label is a hapten or antigen, the sample can be detected by using 
1 5 antibodies. In these systems, a signal is generated by attaching fluorescent or enzy- 
me molecules to the antibodies or, in some cases, by attachment to a radioactive la- 
bel. (Tijssen, P., "Practice and Theory of Enzyme Immunoassays," Laboratory 
Techniques in Ciochemistry and Molecular Biology, Burdon, R.H., van Knippen- 
berg, P.H., Eds., Elsevier (1985), pp. 9-20.). 

20 

The sensitivity of the hybridization assays may be enhanced through use of a nuc- 
leic acid amplification system which multiplies the target nucleic acid being detec- 
ted. Examples of such systems include the polymerase chain reaction (PCR) system 
and the ligase chain reaction (LCR) system. Other methods recently described in the 
25 art are the nucleic acid sequence based amplification (NASBA™, Cangene, Missis- 
sauga, Ontario) and Q Beta Replicase systems. 

An alternative means" for determining the level of expression of a gene encoding a 
fused protein is in situ hybridization. In situ hybridization assays are well known 
30 and are generally described in Angerer, et ah, Methods Enzymol, 1 52: 649-660 
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(1987). In an in situ hybridization assay, cells or tissue specimens are fixed to a so- 
lid support, typically a glass slide. If DNA is to be probed, the cells are denatured 
with heat or alkali. The cells are then contacted with a hybridization solution at a 
moderate temperature to permit annealing of labeled probes specific to fused prote- 
5 ins. The probes are preferably labeled with radioisotopes of fluorescent reporters. 

iii) Amplification based assays. 

In another embodiment, the fused gene or gene product can be detected (assayed) 
using an amplification based assay. In an amplification based assay, all or part of the 

1 0 fused gene or transcript {e.g., mRNA or cDNA) is amplified and the amplification 
product is then detected. Where there is no underlying gene or gene product to act as 
a template amplification is non-specific or non-existent and there is no single amp- 
lification product. Where the underlying gene or gene product is present, the target 
sequence is amplified providing an indication of the presence, absence, or quantity 

1 5 of the underlying gene or mRNA. 

Amplification-based assays are well known to those of skill in the art {see, e.g., In- 
nis, supra.). The cDNA sequence provided for the fused gene is sufficient to enable 
one of skill to routinely select primers to amplify any portion of the gene. 

20 

Amplification primers can be selected to provide amplification products that span 
specific deletions, truncations, and insertions, thereby facilitating the detection of 
specific abnormalities. 

25 iv) Specific detection of abnormalities (e.g., mutations). 

Abnormal fused genes or gene products are characterized by premature stop codons, 
deletions, insertions or change of particular amino acids. Premature stop codons and 
deletions can be detected by decreased size of tfiegene~6r gene product (mRNA 
transcript or cDNA). Similarly, insertions can be detected by increased size of the 
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gene or gene product. Alternatively, mutations can be determined by sequencing of 
the gene or gene product according to standard methods. 

In addition, amplification assays and hybridization probes can be selected to specifi- 

5 cally target particular abnormalities. For example, where the abnormality is a dele- 
tion, nucleic acid probes or amplification primers can be selected that specifically 
hybridize to or amplify, respectively, the nucleic acid sequence that is cdeleted in 
the abnormal gene. The probe will fail to hybridize, or the amplification reaction- 
will fail to provide specific amplification, to abnormal versions of the fused nucleic 

1 0 acids which have the deletion. Alternatively, the probe or amplification reaction can 
be designed to span the entire deletion or either end of the deletion (deletion 
junction). Similarly, probes and amplification primers can be selected that specifi- 
cally target point mutations or insertions. Methods for detecting specific mutations 
were described in, for example, US Patent No. 5,512,441. In the case of PCR, amp- 

1 5 lification primers can be designed to hybridize to a portion of the fused gene but the 
terminal nucleotide at the 3' end of the primer can be used to discriminate between 
the mutant and wild-type forms of fused gene. If the terminal base matches the point 
mutation or the wild-type sequence, polymerase dependent extension can proceed 
and an amplification product is detected. This method for detecting point mutations 

20 or polymorphisms was described in detail by Sommer et ah, (1989) Mayo Clin. 
Proc. 64:1361-1372. By using appropriate controls, one can develop a kit having 
both positive and negative amplification products. The products can be detected 
using specific probes or by simply detecting their presence or absence. A variation 
of the PCR method uses LCR where the point of discrimination, i.e., either the point 

25 mutation or the wild-type bases fall between the LCR oligonucleotides. The ligation 
of the oligonucleotides becomes the means for discriminating between the mutant 
and wild-type forms of the fused gene. 

A variety of automated solid-phase detection techniques are also appropriate for 
30 detecting the presence or absence of mutations in the fused gene. For instance, very 
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large scale immobilized polymer arrays (VLSIPS ), available from Asymetrix, 
Inc. In Santa Clara, CA are used for the detection of nucleic acids having specific 
sequences of interest. See, Fodor et ah (1991) Science, 251: 767-777; Sheldon et ah 
(1993) Clinical Chemistry 39(4): 718-719, and Kozal et ah (1996) Natural Medicine 
5 2(7): 753-759. For example, oligonucleotides that hybridize to all known fused mu- 
tations can be synthesized on a DNA chip (such chips are available from Affy- 
metrix) and the nucleic acids from samples hybridized to the chip for simultaneous 
analysis of the sample nucleic acid for the presence or absence of any of the known 
fused mutations. Protocols for detecting mutations are also described in, for examp- 
1 0 le, Tijssen (1 993) Laboratory Techniques in biochemistry and molecular biology — 
hybridization with nucleic acid probes parts I and hi, Elsevier, new York, and Choo 
(ed) (1994) Methods In Molecular Biology Volume 33-In situ Hybridization Pro- 
tocols, Humana Press Inc., New Jersey (see also, other books in the Methods in 
Molecular Biology series). 

15 

v. Detection of expression levels. 

Where it is desired to quantify the transcription level (and thereby expression) of a 
normal or mutated fused genes in a sample, the nucleic acid sample is one in which 
the concentration of the rnRNA transcript(s) of the fused gene, or the concentration 

20 of the nucleic acids derived from the mRNA transcrip(s), is proportional to the tran- 
scription level (and therefore expression level) of that gene. Similarly, it is preferred 
that the hybridization signal intensity be proportional to the amount of hybridized 
nucleic acid. While it is preferred that the proportionality be relatively strict (e.g., a 
doubling in transcription rate results in a doubling in mRNA transcrip in the sample 

25 nucleic acid pool and a doubling in hybridization signal), one of skill will appreciate 
that the proportionality can be more relaxed and even non-linear. Thus, for example, 
an assay where a 5 fold difference in concentration of the target mRNA results in a 3 
to 6 fold difference in hybridization intensity is sufficient for most purposes. Where 
more precise quantification is required appropriate controls can be run to correct for 

30 variations introduced in sample preparation and hybridization as described herein. In 
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addition, serial dilutions of "standard" target mKNAs can be used to prepare calib- 
ration curves according to methods well known to those of skill in the art. Of course, 
where simple detection of the presence or absence of a transcrip is desired, no elabo- 
rate control or calibration is required. 

5 

C) Pol ypeptide assays. 

The expression of the human fused gene can also be detected and/or quantified by 
detecting or quantifying the expressed fused polypeptide. The fused polypeptides 
can be detected and quantified by any of a number of means well known to those of 

10 skill in the art. These may include analytic biochemical methods such as electropho- 
resis, capillary electrophoresis, high performance liquid chromatography (HPLC), 
thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, or 
various immunological methods such as fluid or gel precipitin reactions, immuno- 
diffusion (single or double), immunoelectrophoresis, radioimmunoassay(RIA), en- 

1 5 zyme-linked immunosorbent assays (ELIS As), immunofluorescent assays, Western 
blotting, and the like. 

In a particularly preferred ambodiment, the fused polypeptides are detected in an 
electrophoretic protein separation, more preferably in a two-dimensional electropho- 
resis, while in a most preferred embodiment, the fused polypeptides are detected 
20 using an immunoassay. 

As used herein, an immunoassay is an assay that utilizes an antibody to specifically 
bind to the analyte (fused polypeptide). The immunoassay is thus characterized by 
detection of specific binding of a fused polypeptide to an aati-fused antibody as op- 
25 posed to the use of other physical or chemical properties to isolate, target, and qu- 
antify the analyte. 

O Electrophoretic Assays! 

As indicated above, the presence or absence of fused polypeptides in a biological 
30 sample can be determined using electrophoretic methods. Means of detecting pro- 
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teins using electrophoretic techniques are well known to those of skill in the art (see 
generally, R. Scopes (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher, 
(1990) Methods in Enzymology Vol. J 82: Guide to Protein Purification., Academic 
Press, Inc., NY.). 

5 

2) Immunological Binding Assays. 

In a preferred embodiment, the fused polypeptides are detected and/or quantified 
using any of a number of well recognized immunological binding assays (see, e.g., 
U.S. Patent 4,366,241 ; 4,376,1 10; 4,517,288; and 4,837,168). For a review of the 

10 general immunoassays, see also Methods in Cell Biology Volume 37: antibodies in 
Cell Biology, Asain, ed. Academic Press, Inc. New York (1993); Basic and Clinical 
Immunology 7th Edition, Stites & Terr, eds. (1991). Immunological binding assays 
(or immunoassays) typically utilize a "capture agent" to specifically bind to and of- 
ten immobilize the analyte (in this case fused polypeptide or subsequence). The 

1 5 capture agent is a moiety that specifically binds to the analyte. In a preferred embo- 
diment, the capture agent is an antibody that specifically binds fused polypeptide(s). 
The antibody (anti-fused) may be produced by any of a number of means well 
known to those of skill in the art as described above in Section III(A). 

20 Immunoassays also often utilize a labeling agent to specifically bind to and label the 
binding complex formed by the capture agent and the analyte. The labeling agent 
may itself be one of the moieties comprising the antibody /analyte complex. Thus, 
the labeling agent may be a labeled fused polypeptide or a labeled anti-fused antibo- 
dy. Alternatively, the labeling agent may be a third moiety, such as another antibo- 

25 dy, that specifically binds to the antibody //used complex. 

In a preferred embodiment, the labeling agent is a second human fused antibody be- 
aring a label. Alternatively, the second fused antibody may lack a label, but it may, 
in turn, be bound by a labeled third antibody specific to antibodies of the species 
30 from which the second antibody is derived. The second can be modified with a de- 
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tectable moiety, such as biotin, to which a third labeled molecule can specifically 
bind, such as enzyme labeled streptavidin. 

Other proteins capable of specifically binding immunoglobulin constant regions, 
5 such as protein A or protein G, may also be used as the label agent. The protein are 
normal constituents of the cell walls of strepococcal bacteria. They exhibit a strong 
non-immunogenic reactivity with immunoglobulin constant regions from a variety 
of species (see, generally Kronval, et al (1973) J. Immunol, 111: 1401-1406, and 
Akerstrom, et al. (1985) J. Immunol, 135:2589-2542). 

10 

Throughout the assays, incubation and/or washing steps may be required after each 
combination of reagents. Incubation steps can vary from about 5 seconds to several 
hours, preferably from about 5 minutes to about 24 hours. However, the incubation 
time will depend upon the assay format, analyte, volume of solution, concentrations, 
15 and the like. Usually, the assays will be carried out at ambient temperature, although 
they can be conducted over a range of temperatures, such as 10°C to 40°C. 

a) Non-Competitive Assay Formats. 

Immunoassays fot detecting fused polypeptide mays be either competitive or non- 
20 competitive. Noncompetitive immunoassays are assays in which the amount of 
captured analyte (in this case fused) is directly measured. In one preferred 
"sandwich" assay, for example, the capture agent (<mti-fused antibodies) can be bo- 
und directly to a solid substrate where the are immobilized. These immobilized anti- 
bodies then capture fused present in the test sample. The fused thus immobilized in 
25 then bound by a labeling agent, such as a second human fused antibody bearing a 
label. Alternatively, the second fused antibody may lack a label, but it may, in turn, 
be bound by a labeled third antibody specific to antibodies of the species from 
which the second antibody is dervied. The second can be modified with a detectable 
moiety, such as biotin, to which a third labeled molecule can specifically bind, such 
30 as enzyme-labeled streptavidin. 
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b) Competitive assay formats. 

In competitive assays, the amount of analyte (fused) present in the sample is measu- 
red indirectly by measuring the amount of an added (exogenous) analyte {fused) 
5 displaced (or competed away) from a capture agent {anti-fused antibody) by the 
analyte present in the sample. In one competitive assay, a known amount of, in this 
case, fused is added to the sample and the sample is then contacted with a capture 
agent, in this case an antibody that specifically binds fused. The amount of fused bo- 
und to the antibody is inversely proportional to the concentration of fused present in 
10 the sample. 

In a particularly preferred embodiment, the antibody is immobilized on a solid sub- 
strate. The amount of fused bound to the antibody may be determined either by mea- 
suring the amount of fused present in a fused/antibody complex, or alternatively by 
1 5 measuring the amount of remaining uncomplexed fused. The amount of fused may 
be detected by providing a labeled fused molecule. 

A hapten inhibition assay is another preferred competivite assay. In this assay a 
known analyte, in this case fused, is immobilized on a solid substrate. A known 
amount of anti-fused antibody is added to the sample, and the sample is then con- 

20 tacted with the immobilized fused. In this case, the amount of anti-fused antibody 
bound to the immobilized fused is inversely proportional to the amount of fused pre- 
sent in the sample. Again the amount of immobilized antibody may be detected by 
detecting either the immobilized fraction of antibody or the fraction of the antibody 
that remains in solution. Detection may be direct where the antibody is labeled or 

25 indirect by the subsequent addition of a labeled moiety that specifically binds to the 
antobdy as described above. 

c) Other assay formats. " ~ 

In a particularly preferred embodiment, Western blot Immunoblot analysis is used to 

30 detect and quantify the presence of fused in the sample. The technique generally 
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comprises separating sample proteins by gel electrophoresis on the basis of mo- 
lecular weight, transferring the separated proteins to a suitable solid support, (such 
as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating 
the sample with the antibodies that specifically bind fused. The anti-fused antibodies 
5 specifically bind to fused on the solid support. These antibodies may be directly la- 
beled or alternatively may be subsequently detected using labeled antibodies (e.g., 
labeled sheep anti-mouse antibodies) that specifically bind to the anti-fused. 

Other assay formats include liposome immunoassays (LIA), which use liposomes 
10 designed to bind specific molecules (e.g., antibodies) and release encapsulated rea- 
gents or markers. The released chemicals are then detected according to standard 
techniques (see, Monroe et al. (\9&6) Amer. Clin. Prod. Rev. 5:34-41). 

d) Scorine of the assay. 

1 5 The assays of this invention are scored (as positive or negative for fused polypepti- 
de) according to standard methods well known to those of skill in the art. The parti- 
cular method of scoring will depend on the assay format and choice of label. For ex- 
ample, a Western Blot assay can be scored by visualizing the colored product pro- 
duced by the enzymatic label. A clearly visible colored band or spot at the correct 

20 molecular weight is scored as a positive result, while the absence of a clearly visible 
spot or band is scored as a negative. In a preferred embodiment, a positive test will 
show a signal intensity (e.g., fused polypeptide quantity) at least twice that of the 
background and/or control and more preferably at least 3 times or even at least 5 ti- 
mes greater than the background and/or negative control. 

25 

o) Reduction of non-specific binding. 

One of skill in the art will appreciate that it is often desirable to reduce non-specific 
binding in immunoassays. Particularly, where the assay involves an antigen or anti- 
body immobilized on a solid substrate it is desirable to minimize the amount of non- 
30 specific bindning to the substrate. Means of reducing such non-specific binding are 
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well known to those of skill in the art. Typically, this involves coating the substrate 
with a proteinaceous composition. In particular, protein compositions such as bovi- 
ne serum albumin (BSA), nonfat powdered milk, and gelatin are widely used with 
powdered milk being most preferred. 

5 

f) Labels. 

The particular label or detectable group used in the assay is not a critical aspect of 
the invention, so long as it does not significantly interfere with the specific binding 
of the antibody used in the assy. The detectable group can be any material having a 

10 detectable physical or chemical property. Such detectable labels have been well- 
developed in the field of immunoassays and, in general, most any label useful in 
such methods can be applied to the present invention. Thus, a label is any composi- 
tion detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Useful labels in the present invention include 

1 5 magnetic beads (e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein isothiocya- 
nate, texas red, rhodamine, and the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), 
enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly 
used in an ELISA), and colorimetric labels such as colloidal gold or colored glass or 
plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. 

20 

The label may be coupled directly or indirectly to the desired component of the as- 
say according to methods well known in the art. As indicated above, a wide variety 
of labels may be used, with the choice of label depending on sensitivity required, 
ease of conjugation with the compound, stability requirements, available instru- 
25 mentation, and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a ligand 
molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds to 
an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or 
30 covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
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compound, or a chemilumtnescent compound. A number of ligands and anti-ligands 
can be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, 
and Cortisol, it can be used in conjunction with the labeled, naturally occurring anti- 
ligands. Alternatively, any haptenic or antigenic compound can be used in combina- 
5 tion with an antibody. 

The molecules can also be conjugated directly to signal generating compounds, e.g., 
by conjugation with an enzyme or fluorophore. Enzymes of interest as labels will 
primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or 

10 oxidoreductases, particularly peroxidases. Fluorescent compounds include fluore- 
scein and its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 
Chemiluminescent compounds include luciferin, and 2,3-dihydrophtalazinediones, 
e.g., luminol. For a review of various labeling or signal producing systems which 
may be used, see, U.S. Patent No. 4,391,904). 

1 5 Means of detecting labels are well known to those of skill in the art. Thus, for ex- 
ample, where the label is a radioactive label, means for detection include a scintilla- 
tion counter or photographic film as in autoradiography. Where the label is a fluore- 
scent label, it may be detected by exciting the fiuorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may 

20 be detected visually, by means of photographic film, by the use of electronic detec- 
tors such as charge coupled devices (CCDs) or photomultipliers and the like. Simi- 
larly, enzymatic labels may be detected by providing the appropriate substrates for 
the enzyme and detecting the resulting reaction product. Finally simple colorimetric 
labels may be detected simply by observing the color associated with the label. 

25 Thus, in various dipstick assays, conjugated gold often appears pink, while various 
conjugated beads appear the color of the bead. 

Some assay formats do riot require the use of labeled components. For instance, 
agglutination assays can be used to detect the presence of the target antibodies. In 
30 this case, antigen-coated particles are agglutinated by samples comprising the target 

SDOCID: <WO 9932609A1_I_> 



WO 99/32609 



53 



PCT/SE98/02384 



antibodies. In this format, none of the components need be labeled and the presence 
of the target antibody is detected by simple visual inspection. 

g) Substrates. 

5 As mentioned above, depending upon the assay, various components, including the 
antigen, target antibody, or anti-human antibody, may be bound to a solid surface. 
Many methods for immobilizing biomolecules to a variety of solid surfaces are 
known in the art. For instance, the solid surface may be a membrane (e.g., nitrocel- 
lulose), a mictrotiter dish (e.g., PVC, polypropylene, or polystyrene), a test tube 

1 0 (glass or plastic), a dipstick (e.g., glass, PVC, polypropylene, polystyrene, latex and 
the like), a microcentrifuge tube, or a glass or plastic bead. The desired component 
may be covalently bound or noncovalently attached through nonspecific bonding. 
A wide variety of organic and inorganic polymers, both natural and synthetic, may 
be employed as the material for the solid surface. Illustrative polymers include poly- 

15 ethylene, polypropylene, poly(4-methylbutene), polystyrene, polymethacrylate, po- 
lyethylene terephthalate), rayon, nylon, polyvinyl butyrate), polyvinylidene difluo- 
ride (PVDF), silicones, polyformaldehyde, cellulose, cellulose acetate, nitrocellulo- 
se, and the like. Other materials which may be employed include paper, glasses, ce- 
ramics, metals, metalloids, semiconductive materials, cements or the like. In addi- 

20 tion, are included substances that form gels, such as proteins (e.g., gelatins) Iipopo- 
lysaccharides, silicates, agarose and polyacrylamides can be used. Polymers which 
form several aqueous phases, such as dextrans, polyalkylene glycols or surfactants, 
such as phospholipids, long chain (12-24 carbon atoms) alkyl ammonium salts and 
the like are also suitable. Where the solid surface is porous, various pore sizes may 

25 be employed depending upon the nature of the system. 

In preparing the surface, a plurality of different materials may be employed, parti- 
cularly as laminates, to obtain [ various properties. For example, protein coatings, 
such as gelatin, can be used to avoid non-specific binding, simplify covalent conju- 
30 gation, enhance signal detection or the like. 
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If covalent bonding between a compound and the surface is desired, the surface will 
usually be polyfunctional or be capable of being polyfunctionalized. Functional 
groups which may be present on the surface and used for linking can include car- 
5 boxylic acids, aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl 
groups, mercapto groups and the like. The manner of linking a wide variety of com- 
pounds to various surfaces is well known and is amply illustrated in the literature. 
See, for example, Immobilized Enzymes, Ichiro Chibata, Halsted Press, New York, ' 
1978, and Cuatrecasas (1970) J. Biol. Chem. 245 3059). 

10 

In addition to covalent binding, various methods for noncovalently binding an assay 
component can be used. Noncovalent binding is typically nonspecific absorption of 
a compound to the surface. Typically, the surface is blocked with a second compo- 
und to prevent nonspecific binding of labeled assay components. Alternatively, the 
1 5 surface is designed such that is nonspecifically binds one component but does not 
significantly bind another. For example, a surface bearing a lectin such as Concana- 
valin A will bind a carbohydrate containing compound but not a labeled protein that 
lacks glycosylation. Various solid surfaces for use in noncovalent attachment of as- 
say components are reviewed in U.S. Patent Nos. 4,447,576 and 4,254,082. 

20 

D) Evaluation of fused expression levels and/or abnormal expression. 
One of skill will appreciate that abnormal expression levels or abnormal expression 
products (e.g., mutated transcripts, truncated or non-sense polypeptides) are identifi- 
ed by comparison to normal expression levels and normal expression products. 

25 Normal levels of expression or normal expression products can be determined for 
any particular population, subpopulation, or group of organisms according to stan- 
dard methods well known to those of skill in the art. Typically this involves identi- 
fying healthy organisms and measuring expression levels of the fused gene (as 
described herein) or sequencing the gene, mRNA, or reverse transcribed cDNA, to 

30 obtain typical (normal) sequence variations. Application of standard statistical met- 
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hods used in molecular genetics permits determination of baseline levels of expres- 
sion, and normal gene products as well as significant deviations from such baseline 
levels. 

5 E) Detection kits. 

The present invention also provides for kits for the diagnosis of organisms (e.g., 
patients). The kits preferably include one or more reagents for determining the pre- 
sence or absence of the fused gene, for quantifying expression of the fused gene, or 
for detecting an abnormal fused gene or expression products of an abnormal fused 

10 gene. Preferred reagents inlcude nucleic acid probes that specifically bind to the 
normal fused gene, cDNA, or subsequence thereof, probes that specifically bind to 
abnormal fused gene (e.g., fused containing premature truncations, insertions, or 
deletions), antibodies that specifically bind to normal fused polypeptides or sub- 
sequences thereof, or antibodies that specifically bind to abnormal fused polypepti- 

1 5 des or subsequences thereof. The antibody or hybridization probe may be free or 
immobilized on a solid support such as a test tube, a microtiter plate, a dipstick and 
the like. The kit may also contain instructional materials teaching the use of the an- 
tibody or hybridization probe in an assay for the detection of a deviation from the 
normal sequence of a fused gene. 

20 

The kits may include alternatively, or in combination with any of the other compo- 
nents described herein, an anti-fused antibody. The antibody can be monoclonal or 
polyclonal. The antibody can be conjugated to another moiety such as a label and/or 
it can be immobilized on a solid support (substrate). 

25 

The kits(s) may also contain a second antibody for detection of polypeptide/antibody 
complexes according to the invention or for detection of hybridized nucleic acid 
probes. The kit may contain appropriate reagents for detection of labels, positive and 
negative controls, washing solutions, dilution buffers and the like. 
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VT. Modulation of expression of endogenous fused genes. 

In still another emobidment, this invention provides methods of regulating the ex- 
pression of endogenous fused genes. The expression of a fused gene product may be 
increased. Regulation of fused gene may provide a convenient and controllable mo- 
5 del system for the study of the HH-PTC pathway, especially in human beings. 

Methods of altering the expression of endogenous genes are well known to those of 
skill in the art. Typically such methods involve altering or replacing all or a portion 
of the regulatory sequences controlling expression of the particular gene that is to be 
10 regulated. In a preferred embodiment, the regulatory sequences {e.g., the native 
promoter) upstream of the fused gene is altered. 

This is typically accomplished by the use of homologous recombination to introduce 
a heterologous nucleic acid into the native regulatory sequences. To downregulate 
expression the fused gene product, simple mutations that either alter the reading 
1 5 frame or disrupt the promoter are suitable. To upregulate expression of the fused ge- 
ne product, the native promoter(s) can be substituted with heterologous promoter(s) 
that induce higher than normal levels of transcription. 

In a particularly preferred embodiment, nucleic acid sequences comprising the 
20 structural gene in question or upstream sequences are utilized for targeting hetero- 
logous recombination constructs. Upstream and downstream sequences can be rea- 
dily determined using the information provided herein. Such sequences, for examp- 
le, can be extended using 5'- or 3'-RACE and homologous recombination constructs 
created with only routine experimentation. 

25 

The use of homologous recombination to alter expression of endogenous genes is 
described in detail in U.S. Patent 5,272,071, WO 91/09955, WO 93/09222, WO 
96/2941 1, WO 95/31560, and WO 91/12650. 

30 
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VII. Therapeutics 

A) Pharmaceutical Compositions 

The proteins, polypeptides, antibodies and mti-fused antibody-effector (e.g., enzy- 
me toxin, hormone, growth factor, drug, etc.) conjugates or fusion proteins of this 
5 invention are useful for parenteral, topical, oral, or local administration, such as by 
aerosol or transdermally, for prophylactic and/or therapeutic treatment. The pharma- 
ceutical compositions can be administered in a variety of unit dosage forms depen- 
ding upon the method of administration. For example, unit dosage forms suitable for 
oral administration include powder, tablets, pills, capsules and lozenges. It is recog- 

1 0 nized that the polypeptides and related compounds decribed of, when administered 
orally, must be protected from digestion. This is typically accomplished either by 
complexing the protein with a composition to render it resistant to acidic and enzy- 
matic hydrolysis or by packaging the protein in an appropriately resistant carrier 
such as a liposome. Means of protecting proteins from digestion are well known in 

15 the art. 

The pharmaceutical compositions of this invention may be used for topical administ- 
ration, e.g. to treat basal cell carcinomas, or their precursors, solar keratoses. In 
another embodiment, the compositions are useful for parenteral administration, such 
as intravenous administration or administration into a body cavity or lumen of an 

20 organ. The compositions for administration will commonly comprise a solution of 
the polypeptide, antibody or antibody chimera/fusion dissolved in a pharmaceuti- 
cally acceptable carrier, preferably an aqueous carrier. A variety of aqueous carriers 
can be used, e.g., buffered saline and the like. These solutions are sterile and gene- 
rally free of undesirable matter. These compositions may be sterilized by conventio- 

25 nal, well known sterilization techniques. The compositions may contain pharmaceu- 
tically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and 
the like, for example," sodium acetate,' sodium chloride, potassium chloride, 'calcium" 
chloride, sodium lactate and the like. The concentration of chimeric molecule in the- 

30 se formulations can vary widely, and will be selected primarily based on fluid volu- 
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mes, viscosities, body weight and the like in accordance with the particular mode of 
administration selected and the patient's needs. 

Thus, a typical pharmaceutical composition for intravenous administration would be 
5 about 0. 1 to 1 0 mg per patient per day. Dosages from 0.1 up to about 1 00 mg per 
patient per day may be used, particularly when the drug is administered to a seclu- 
ded site and not into the blood stream, such as into a body cavity or into a lumen of 
an organ. Substantially higher dosages are possible in topical administration. Actual 
methods for preparing parenterally administrable compositions will be known or ap- 
10 parent to those skilled in the art and are described in more detail in such pulications 
as Remington 's Pharmaceutical Science, 15 th ., Mack Publishing Company, Easton, 
Pennsylvania (1980). 

The compositions containing the present polypeptides, antibodies or antibody chi- 
1 5 mer/fusion, or a cocktail therof (i.e., with other proteins), can be administered for 
therapeutic treatments. In therapeutic applications, compositions are administered to 
a patient suffering from a disease in an amount sufficient to cure or at least partially 
arrest the disease and its complications. An amount adequate to accomplish this is 
defined as a "therapeutically effective dose." Amounts effective for this use will de- 
20 pend upon the severity of the disease and the general state of the patient's health. 

Single or multiple administrations of the compositions may be administered depen- 
ding on the dosage and frequency as required and tolerated by the patient. In any 
event, the composition should provide a sufficient quantity of the proteins of this in- 
25 vention to effectively treat the patient. 

Cellular Transformation and Gene Therapy. 
The present invention provides packageable human fused nucleic acids (cDNAs) for 
the transformation of cells in vitro and in vivo. These packageable nucleic acids can 
30 be inserted into any of a number of well known vectors for the transfection and 
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transformation of target cells and organisms as described below. The nucleic acids 
are transfected into cells, ex vivo or in vivo, through the interaction of the vector and 
the target cell. The fused cDNA, under the control of a promoter, then expresses the 
fused protein thereby mitigating the effects of absent fused genes or partial inactiva- 
5 tion of the fused gene or abnormal expression of the fused gene. 

Such gene therapy procedures have been used to correct acquired and inherited ge- 
netic defects, cancer, and viral infection in a number of contexts. The ability to arti- 
fical genes in humans facilitates the prevention and/or cure of many important hu- 

10 man diseases, including many diseases which are not amenable to treatment by other 
therapies. As an example, in vivo expression of cholesterol-regulating genes, genes 
which selectively block the replication of HTV, and tumor-suppressing genes in hu- 
man patients dramatically improves the treatment of heart disease, AIDS, and can- 
cer, respectively. For a review of gene therapy procedures, see Anderson, Science 

15 (1992) 256:808-813; Nabel and Feigner (1993) TIBTECH 1 1 : 211-217; Mitani and 
Caskey (1993) TIBTECH 11: 162-166; Mulligan (1993) Science 926-932; Dillon 
(1993) TIBTECH 1 1 : 167-175; Miller (1992) Nature 357: 455-460; Van Brunt 
(1988) Biotechnology 6(10): 1 149-1 154; Vigne (1995) Restorative Neurology and 
Neuroscience 8: 35-36; Kremer and Perricaudet (1995) British Medical Bulletin 

20 51(1) 31-44; Haddada et al. (1995) in Current Topics in Microbiology and Immu- 
nology Doerfler and Bohm (eds) Springer- Verlag, Heidelberg Germany; and Yu et 
al, Gene Theraphy{\994) 1:13-26. 



Delivery of the gene or genetic material into the cell is the first critical step in gene 
25 therapy treatment of disease. A large number of delivery methods are well known to 
those of skill in the art. Such methods include, for example liposome-based gene 
delivery (Debs and Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) 
BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham (1991) WO 
91/06309; and Feigner et al. (1987) Proc. Natl. Acad. Sci. USA 84: 7413-7414), and 
30 replication-defective retroviral vectors harboring a therapeutic polynucleotide 
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sequence as part of the retroviral genome (see, e.g., Miller et al (1990) Mol Cell. 
Biol. 10:4239 (1990; Kolberg (1992) J. NIHRes. 4:43, and Cornetta et al. Hum. 
Gene Ther. 2:215 (1991)). Widely used retroviral vectors include those based upon 
murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Inraiu- 

5 no deficiency virus (SIV), human immuno deficiency virus (HTV), and combinations 
thereof. See, e.g., Buchscher et al. (1992) J. Virol. 66 (5) 2731-2739; Johann et al. 
(1992) J. Virol. 66 (5):1635-1640 (1992); Sommerfelt et al, (1990) Virol. 176:58- 
59; Wilson et al. (1989) J. Virol. 63:2374-2378; Miller et al, J Virol. 65:2220-2224 
(1991); Wong-Staal et al, PCT/US94/05700, and Rosenburg and Fauci (1993) in 

1 0 Fundamental Immunology, Third Edition Paul (ed) Raven Press, Ltd., New York 
and the references therein, and Yu et al, Gene Therapy (1 994) supra). 

AAV-based vectors are also used to transduce cells with target nucleic acids, e.g., in 
the in vitro production of nucleic acids and peptides, and in in vivo and ex vivo gene 

1 5 therapy procedures. See, West et al. ( 1 987) Virology 1 60:38-47; Carter et al. ( 1 989) 
U.S. Patent No. 4.797,368; Carter et al. WO 93/24641 (1993); Kotin (1994) Human 
Gene Therapy 5:793-801; Myzyczka (1994) J. Clin. Invest. 94:1351 and Samulski 
{supra) for an overview of AAV vectors. Construction of recombinant AAV vectors 
are described in a number of pulbications, including Lebkowski, U.S. Pat. No. 

20 5,173,414; Tratschin et al (1985) Mol. Cell Biol 5(1 1):3251-3260; Tratschin, et al. 
(1984) Mol. Cell. Biol. 4:2072-2081 ; Hermonat and Muzyczka (1984) Proc. Natl 
Acad. Sci. USA 81:6466-6470; McLaughlin et al. (1988) and Samulski et al (1989) 
J. Virol. 63:03822-3828. Cell lines that can be transformed by rAAV include those 
described in Lebkowski et al. (1988) Mol Cell. Biol. 8:3988-3996. 

25 

vm. 

A) Ex vivo transformation of cells. 

Ex vivo cell transformation "for diagnostics, reserach, or for gene therapy (e.g., via 
re-infusion of the transformed cells into the host organism) is well known to those of 
30 skill in the art. In a preferred embodiment, cells are isolated from the subject orga- 
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nism, transfected with the fused gene or cDNA of this invention, and re-infused 
back into the subject organism {e.g., patient). Various cell types suitable for ex vivo 
transformation are well known to those of skill in the art. Paricular preferred cells 
are progenitor or stem cells (see, e.g., Freshney et al.,m Culture of Animal Cells, a 
5 Manual of Basic Technique, third edition Wiley-Liss, New York ( 1 994)) and the 
references cited therein for a discussion of how to isolate and culture cells from pa- 
tients). 

As indicated above, in a preferred embodiment, the packageable nucleic acid enco- 
1 0 des a fused polypeptide under the control of an activated or constitutive promoter. 

In one particularly preferred embodiment, stem cells are used in ex-vivo procedures 
for cell transformation and gene therapy. The advantage to using stem cells is that 
they can be differentiated into other cell types in vitro, or can be introduced into a 
1 5 mammal (such as the donor of the cells) where they will engraft in the bone marrow. 
Methods for differentiating CD34 + cells in vitro into clinically important immune 
cell types using cytokines such a GM-CSF, IFN-y and TNF-a are known (see, Inaba 
et al. (1992) J. Exp. Med. 176: 1693-1702, and Szabolcs et al. (1995) 154: 5851- 
5861). 

20 

Stem cells are isolated for transduction and differentiation using known methods. 
For example, in mice, bone marrow cells are isolated by sacrificing the mouse and 
cutting the leg bones with a pair of scissors. Stem cells are isolated from bone mar- 
row cells by panning the bone marrow ceils with antibodies which bind unwanted 
25 cells, such as CD4 + and CD8 + (T cells), CD45 + (panB cells), GR- 1 (granulocytes), 
and Ia d (defferentiated antigen presenting cells). For an example of this protocol see, 
Inaba et al. (1992) J. Exp. Med. 176: 1693-1702. 

In humans, bone marrow aspirations from iliac crests are performed e.g., under ge- 
30 neral anesthesia in the operating room. The bone marrow aspirations is approxima- 
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tely 1,000 ml in quantity and is collected from the posterior iliac bones and crests. If 
the total number of cells collected is less than about 2 x 1 0 8 /kg, a second aspiration 
using the sternum and anterior iliac crests in addition to posterior crests is perfor- 
med. During the operation, two units of irradiated packed red cells are administered 

5 to replace the volume of marrow taken by the aspiration. Human hematopoietic pro- 
genitor and stem cells are characterized by the presence of a CD34 surface membra- 
ne antigen. This antigen is used for purification, e.g., on affinity columns which bind 
CD34. After the bone marrow is harvested, the mononuclear cells are separated 
from the other components by means of ficoll gradient centrifugation. This is per- 

1 0 formed by a semi-automated method using a cell separator (e.g., a Baxter Fenwal 
CS3000+ or Terumo machine). The light density cells, composed mostly of mono- 
nuclear cells are collected and the cells are incubated in plastic flasks at 37°C for 1 .5 
hours. The adherent cells (monocytes, macrophages and B-Cells) are discarded. The 
non-adherent cells are then collected and incubated with a monoclonal anti-CD34 

1 5 antibody (e.g., the murine antibody 9C5) at 4°C for 30 minutes with gentle rotation. 
The final concentration for the anti-CD34 antibody is 10 jig/ml. After two washes, 
paramagnetic microspheres (DynaBeads, supplied by Baxter Immunotherapy Group, 
Santa Ana, California) coated with sheep antimouse IgG (Fc) antibody are added to 
the cell suspension at a ratio of 2 cells/bead. After a further incubation period of 30 

20 minuter at 4°C, the resetted cells with magnetic beads are collected with a magnet. 
Chymopapain (supplied by Baxter Immunotherapy Group, Santa Ana, California) at 
a final concentration 200 U/ml is added to release the beads from the CD34+ cells. 
Alternatively, and preferably, an affinity column isolation procedure can be used 
which binds to CD34, or to antibodies bound to CD34 (see, the examples below). 

25 See, Ho et al. ( 1 995) Stem Cells 1 3 Usuppl. 3): 1 00- 1 05 . See also, Brenner ( 1 993) 
Journal of hematotherapy 2: 7-17. 

In another embodiment, hematopoietic stem cells are isolated from fetal cord blood. 
Yu et al. (1995) Proc. Natl. Acad. Sci. USA 92: 699-703 describe a preferred met- 
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hod of transducing CD34 + cells from human fetal cord blood using retroviral vec- 
tors. 

B) In vivo transformation 

Vectors (e.g., retroviruses, adenoviruses, liposomes, etc.) containing therapeutic 
nucleic acids can be administered directly to the organism for transduction of cells 
in vivo. Administration is by any of the routes normally used for introducing a mo- 
lecule into ultimate contact with blood or tissue cells. The packaged nucleic acids 
are administered in any suitable manner, preferably with pharmaceutically accep- 
table carriers. Suitable methods of administering such packaged nucleic acids are 
available and well known to those of skill in the art, and, although more than one 
route can be used to administer a particular composition, a particular route can often 
provide a more immediate and more effective reaction than another route. 

Pharmaceutically acceptable carriers are determined in part by the particular compo- 
sition being administered, as well as by the particular method used to administer the 
composition. Accordingly, there is a wide variety of suitable formulations of phar- 
maceutical compositions of the present invention. 

Formulations suitable for oral administration can consist of (a) liquid solutions, such 
as an effective amount of the packagecd nucleic acid suspended in diluents, such as 
water, saline or PEG 400; (b) capsules, sachets or tablets, each containing a prede- 
termined amount of the active ingredient, as liquids, solids, granules or gelatin; (c) 
suspensions in an appropriate liquid; and (d) suitable emulsions. Tablet forms can 
include one or more of lactose, sucrose, mannitol, sorbitol, calcium phosphates, corn 
starch, potato starch, tragacanth, microcrystalline cellulose, acacia, gelatin, colloidal 
silicon dioxide, croscarmellose sodium, talc, magnesium stearate, stearic acid, and 
other excipiehts, colorants, fillers; binders; diluents, buffering agents; moistening" 
agents, preservatives, flavoring agents, dyes, disintegrating agents, and pharmaceu- 
tically compatible carriers. Lozenge forms can comprise the active ingredient in a 
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flavor, usually sucrose and acacia or tragacanth, as well as pastilles comprising the 
active ingredient in an inert base, such as gelatin and glycerin or sucrose and acacia 
emulsions, gels, and the like containing, in addition to the active ingredient, carriers 
known in the art. 

5 

The packaged nucleic acids, alone or in combination with other suitable compo- 
nents, can be made into aerosol formulations {i.e., they can be "nebulized") to be 
administered via inhalation. Aerosol formulations can be placed into pressurized ac- 
ceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the 
10 like. 

Suitable formulations for rectal administration include, for example, suppositiories, 
which consist of the packaged nucleic acid with a suppository base. Suitable suppo- 
sitory bases include natural or synthetic triglycerides or paraffin hydrocarbons. In 
1 5 addition, it is also possible to use gelatin rectal capsules which consist of a combi- 
nation of the packaged nucleic acid with a base, including, for example, liquid trig- 
lycerides, polyethylene glycols, and paraffin hydrocarbons. 

Formulations suitable for parenteral administration, such as, for example, by intra- 
articular (in the joints), intravenous, intramuscular, intradermal, intraperitoneal, and 

20 subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection 
solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that 
render the formulation isotonic with the blood of the intended recipient, and aqueous 
and non-aqueous sterile suspensions that can include suspending agents, solubili- 
zers, thickening agents, stabilizers, and preservatives. In the practice of this inven- 

25 tion, compositions can be administered, for example, by intravenous infusion, orally, 
topically, intraperitoneally, intravesically or intrathecal ly. Parenteral administration 
and intravenous administration are the preferred methods of administration. The 
formulations of packaged nucleic acid can be presented in unit-dose or multi-dose 
sealed containers, such as ampules and vials. 

30 
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Injection solutions and suspensions can be prepared from sterile powders, granules, 
and tablets of the kind previously described. Cells transduced by the packaged nuc- 
leic acid as decribed above in the context of ex vivo therapy can also be administe- 
red intravenously or parenterally as described above. 

5 

The dose administered to a patient, in the context of the present invention should be 
sufficient to effect a beneficial therapeutic response in the patient over time. The do- 
se will be determined by the efficacy of the particular vector employed and the con- 
dition of the patient, as well as the body weight or surface area of the patient to be 
1 0 treated. The size of the dose also will be determined by the existence, nature, and 
extent of any adverse side-effects that accompany the administration of a particular 
vector, or transduced cell type in a particular patient. 

For administration, inhibitors and transduced cells of the present invention can be 
1 5 administered at a rate determined by the LD-50 of the inhibitor, vector, or transdu- 
ced cell type, and the side-effects of the inhibitor, vector or cell type at various con- 
centrations, as applied to the mass and overall health of the patient. Administration 
can be accomplished via single or divided doses. 

In a preferred embodiment, prior to infusion, blood samples are obtained and saved 
20 for analysis. Between 1 X 10 8 and IX 10 12 transduced cells are infused intrave- 
nously over 60-200 minutes. Vital signs and oxygen saturation by pulse oximetry 
are closely monitored. Blood samples are obtained 5 minutes and 1 hour following 
infusion and saved for subsequent analysis. Leukopheresis, transduction and reinfu- 
sion can be repeated are repeated every 2 to 3 months. After the first treatment, infu- 
25 sions can be performed on a outpatient basis at the discretion of the clinician. If the 
reinfusions is given as an outpatient, the participant is monitored for at least 4, and 
preferably 8 hours following the therapy. 

Transduced cells are prepared for reinfusion according to established methods. See, 
30 Abrahamsen et al. ( 1 99 1 ) J. Clin. Apheresis, 6: 48-53 ; Carter et al. ( 1 988) J. Clin. 
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Apheresis, 4:1 13-1 17; Aebersold et al (1988) J. Immunol. Meth., 1 12: 1-7; Muul et 
al. (1987) J. Immunol. Methods 101:171-181 and Carter*?/ al. (1987) Transfusion 
27: 362-365. After a period of about 2-4 weeks in culture, the cells should number 
between 1 X 10 8 and 1 X 10 12 . In this regard, the growth characteristics of cells vary 
5 from patient to patient and from cell type to cell type. About 72 hours prior to rein- 
fusion of the transduced cells, an aliquot is taken for analysis of phenotype, and per- 
centage of cells expressing the therapeutic agent. 

EXAMPLES 

1 0 The following examples are offered to illustrate, but not to limit the present inven- 
tion. 

Cloning of a human Fused homolog. 

By searching the EST database a partial cDNA clone (GenBank Acc: F07835) was 
1 5 identified having significant sequence identity with the Drosophila Fused. Based on 
the sequence of this clone. PCR primers were designed in order to use the RACE 
technology to isolate the missing sequence of the human homolog (Zaphiropoulos 
and Toftgard, DNA Cell Biology, 15, 1049-1056,1996). The PCR primers used to 
extend the sequence in the 5' direction from epidermal cDNA were 5' AGA TGC 
20 GAC GGC TGG GGT C and 5' GCG TCG ACT GCA GTA GGT CCC GGC AGT 
CTC (nested primer). The PCR primers used to extend the sequence in the 3' direc- 
tion from epidermal cDNA were 5' TCT TCG GGC AGC CCC CCT T and 5' GCG 
TCG ACG TTC TCG GAG CTG GAA GAG AAG AT (nested primer). 

25 The products of the RACE amplification were cloned into the pGEM 5 vector 

(Promega) and sequenced with dye-didioxy nucleotides using the facilities of Cy- 
bergene AB. 



Two additional splice variants of the human fused homolog were also identified. 
30 Variant I had 4 extra nucleotides (GTAG) between positions 124 and 125, and 
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splice variant II lacked the 7 nucelotides (GTGTGCA) at positions 1 18 to 124. In 
these variant 5' sequences there are no methionine codons in frame with the remai- 
ning fused sequence, but instead termination codons, suggesting that a downstream 
methionine or non-methionine codons are likely to nbe used for translation initia- 
5 tions. 

Cloning of a human fused homolog. 

By searching the EST database a partial cDNA clone (GenBank Acc: F07835) was 
identified having significant sequence identity with the drosophila fused. Based on 

1 0 the sequence of this clone, PCR primers were designed in order to use the RACE 
technology to isolate the missing sequence of the human homolog (Zaphiropoulos 
and Toftgard, DNA Cell Biology, 15, 1049-1056, 19969). The PCR primers used to 
extend the sequence in the 5' direction from epidermal cDNA were 5' AGA TGC 
GAC GGC TGG GGT C and 5' GCG TCG ACT GCA GTA GGT CCC GGC AGT 

15 CTC (nested primer=. The PCR primers used to extend the sequence in the 3' direc- 
tion from epidermal cDNA were 5' TCT TCG GGC AGC CCC CCT T and 5' GCG 
TCG ACG TTC TCG GAG CTG GAA GAG AAG AT (nested primer). 
The product of the RACE amplification were cloned into the pGEM 5 vector 
(Promega) and sequenced with dye-dideoxy necleotides using the facilities of Cy- 

20 bergene AB. 

Two additional splice variants of the human fused homolog were also identified. 
Variant I had 4 extra nucleotides (GTAG) between positions 124 and 125, and 
splice variant II lacked the 7 nucleotides (GTGTGCA) at positions 1 18 to 124. In 
25 these variant 5' sequences there are no methionine codons in frame with the remai- 
ning fused sequence, but instead termination codons, suggesting that a downstream 
methionine or non-methionine codons are likely to be used for translation initiation. 

The human fused homolog the the following properties: 
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a. 33,5% amino acid identity with Drosophila fused (within the kinase domain the 
identity increases to 37,0%). 

b. 54,0% amino acid similarity with Drosophila fused (within the kinase domain the 
similarity increases to 59,9%). 

5 c. Detected mRNA expresion in epidermis, pancreas, kidney, skeletal muscle, liver, 
placeta brain, with highest levels in lung. 
4. Observed evolutionary conservation in other species including monkey, rat, mou- 
se, dog and cow. 
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CLAIMS 

1. An isolated protein molecule capable of involvement in eliciting an intracellular 
signal in the human HH-PTC pathway and exhibiting a substantial similarity with 
the sequence disclosed in SEQ ED NO 2. 

5 

2. A protein according to claim 1, which exhibits a sequence similarity of about 80% 
with the sequence disclosed in SEQ ID NO 2. 

3. A protein according to claim 1 or 2, which is comprised of the amino acid 

1 0 sequence disclosed in SEQ ID NO 2, or an analogue or functional equivalent there- 
of. 

4. An isolated polypeptide capable of involvement in eliciting an intracellular signal 
in the human HH-PTC pathway and comprised of a subsequence of about 8-15 con- 

1 5 tiguous amino acids of the sequence disclosed in SEQ ID NO 2. 

5. A polypeptide according to claim 4 comprised of 10 contiguous amino acids of 
the sequence disclosed in SEQ ID NO 2. 

20 6. A polypeptide according to claim 4 or 5 for use as a medicament. 

7. A substance capable of activating or inhibiting a protein according to any one of 
claims 1-3 or a polypeptide according to claim 4 or 5, which substance is characteri- 
zed in the Kinase-regulating property thereof. 

25 

8. A substance according to claim 7 for use as a medicament. 

9. A DNA sequence encoding a protein according to any one of claims 1-3; or a va- 
riant thereof. 
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10. A DNA sequence encoding a polypeptide according to claim 4 or 5, or a variant 
thereof. 

1 1 . An isolated human DNA sequence capable of involvment in eliciting an intra- 

5 cellular signal in the HH-PTCH pathway comprising essentially all of the sequence 
as disclosed in SEQ ID NO 1 . 

12. A vector comprising such a DNA sequence as defined in any one of claims 9-11. 

10 13. An expression cassette comprising such a DNA sequence as defined in any one 
of claims 9-11. 

14. A cell comprising such an expression cassette as defined in claim 13 or a vector 
according to claim 12. 

15 

15. An antibody which specifically binds to such a polypeptide as defined in claim 4 
or 5. 

16. An antibody according to claim 15, which is a monoclonal antibody. 

20 

17. A recombinant cell expressing such an antibody as defined in claim 15 or 16. 

1 8. An antibody according to claim 1 5 or 1 6 for use as a medicament. 

25 1 9. A pharmaceutical composition which comprises a polypeptide according to 
claim 4 or 5 and a pharmaceutically acceptable carrier. 

20. A kit for detection of a human fused gene, which kit comprises a nucleic acid 
sequence capable of hybridising specifically with the sequence disclosed in SEQ ID 
30 NOl. 
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21 . A kit for detection of a protein encoded by a human fused gene, comprising a 
container containing such an antibody as defined in claim 15 or 16. 
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(1 ) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Karolinska Innovations AB 

(B) STREET: Karolinska Institutet 

(C) CITY: Stockholm 

(E) COUNTRY: SWEDEN 

(F) POSTAL CODE (ZIP): 1 71 77 

(ii) TITLE OF INVENTION: FUSED GENE 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:90..1533 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 
CTGGGGTCCC CCGCGCCTGG ACGGCTTCAT CCTCACCGAG CGCCTGGGCA GCGGCACGTA 60 

CGCCACGGTG TACAAGGCCT ACGCCAAGA ATG CAA AGA AAT GGA TCA GCT AGC 1 1 3 

Met Gin Arg Asn Gly Ser Ala Ser 
1 5 

CGA GGG TTG GAG AAG ACC AGG CTG AGG CTG TGT CGG GAA GCC AGG ATT 1 61 

Arg Gly Leu Glu Lys Thr Arg Leu Arg Leu Cys Arg Glu Ala Arg lie 
10 15 20 

CCA GAA TCA GCA TTC CTC ACT GGC CTC ACA AGG GAA AGC TGG GAA GCC 209 
Pro Glu Ser Ala Phe Leu Thr Gly Leu Thr Arg Glu Ser Trp Glu Ala 
25 30 35 40 
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CGG TGC TGG TGT GCA AAG GAC ACT CGT GAA GTG GTA GCC ATA AAG TGT 257 
Arg Cys Tip Cys Ala Lys Asp Thr Arg Glu Val Val Ala He Lys Cys 
45 50 55 

GTA GCC AAG AAA AGT CTG AAC AAG GCA TCG GTG GAG AAC CTC CTC ACG 305 
Val Ala Lys Lys Ser Leu Asn Lys Ala Ser Val Glu Asn Leu Leu Thr 
60 65 70 

GAG ATT GAG ATC CTC AAG GGC ATT CGA CAT CCC CAC ATT GTG CAG CTG 353 
Glu He Glu He Leu Lys Gly He Arg His Pro His He Val Gin Leu 
75 80 85 

AAA GAC TTT CAG TGG GAC AGT GAC AAT ATC TAC CTC ATC ATG GAG TTT 401 
Lys Asp Phe Gin Trp Asp Ser Asp Asn lie Tyr Leu lie Met Glu Phe 
90 95 100 

TGC GCA GGG GGC GAC CTG TCT CGC TTC ATC CAT ACC CGC AGG ATT CTG 449 
Cys Ala Gly Gly Asp Leu Ser Arg Phe lie His Thr Arg Arg He Leu 
105 110 115 120 

CCT GAG AAG GTG GCG CGT GTC TTC ATG CAG CAA TTA GCT AGC GCC CTG 497 
Pro Glu Lys Val Ala Arg Val Phe Met Gin Gin Leu Ala Ser Ala Leu 
125 130 135 

CAA TTC CTG CAT GAA CGG AAT ATC TCT CAC CTG GAT CTG AAG CCA CAG 545 
Gin Phe Leu His Glu Arg Asn He Ser His Leu Asp Leu Lys Pro Gin 
140 145 150 

AAC ATT CTA CTG AGC TCC TTG GAG AAG CCC CAC CTA AAA CTG GCA GAC 593 
Asn lie Leu Leu Ser Ser Leu Glu Lys Pro His Leu Lys Leu Ala Asp 
155 160 165 

TTT GGT TTC GCA CAA CAC ATG TCC CCG TGG GAT GAG AAG CAC GTG CTC 641 
Phe Gly Phe Ala Gin His Met Ser Pro Trp Asp Glu Lys His Val Leu 
170 175 180 

CGT GGC TCC CCC CTC TAC ATG GCC CCC GAG ATG GTG TGC CAG CGG CAG 689 
Arg Gly Ser Pro Leu Tyr Met Ala Pro Glu Met Val Cys Gin Arg Gin 
185 190 195 200 

TAT GAC GCC CGC GTG GAC CTC TGG TCC ATG GGG GTC ATC CTG TAT GAA 737 
Tyr Asp Ala Arg Val Asp Leu Trp Ser Met Gly Val He Leu Tyr Glu 
205 210 215 

GCC CTC TTC GGG CAG CCC CCC TTT GCC TCC AGG TCG TTC TCG GAG CTG 785 
Ala Leu Phe Gly Gin Pro Pro Phe Ala Ser Arg Ser Phe Ser Glu Leu 
220 225 230 

GAA GAG AAG ATC CGT AGC AAC CGG GTC ATC GAG CTC CCC TTG CGG CCC 833 
Glu Glu Lys He Arg Ser Asn Arg Val He Glu Leu Pro Leu Arg Pro 
235 240 245 

CTG CTC TCC CGA GAC TGC CGG GAC CTA CAG CAG CGG CTC CTG GAG CGG 881 
Leu Leu Ser Arg Asp Cys Arg Asp Leu Gin Gin Arg Leu Leu Glu Arg 
250 255 260 



GAC CCC AGC CGT CGC ATC TCC TTC CAG GAC TTC TTT GCG CAC CCC TGG 
Asp Pro Ser Arg Arg He Ser Phe Gin Asp Phe Phe Ala His Pro Trp 
265 270 275 280 
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GTG GAC CTG GAG CAC ATG CCC AGT GGG GAG AGT CTG GGG CGA GCA ACC 977 
Val Asp Leu Glu His Met Pro Ser Gly Glu Ser Leu Gly Arg Ala Thr 
285 290 295 



GCC CTG GTG GTG CAG GCT GTG AAG AAA GAC CAG GAG GGG GAT TCA GCA 
Ala Leu Val Val Gin Ala Val Lys Lys Asp Gin Glu Gly Asp Ser Ala 
300 305 310 

GCC GCC TTA TCA CTC TAC TGC AAG GCT CTG GAC TTC TTT GTA CCT GCC 
Ala Ala Leu Ser Leu Tyr Cys Lys Ala Leu Asp Phe Phe Val Pro Ala 
315 320 325 



1025 



1073 



CTG CAC TAT G AA GTG GAT GCC CAG CGG AAG GAG GCA ATT AAG GCA AAG 1121 
Leu His Tyr Glu Val Asp Ala Gin Arg Lys Glu Ala He Lys Ala Lys 
330 335 340 

GTG GGG CAG TAC GTG TCC CGG GCT GAG GAG CTC AAG GCC ATC GTC TCC 1 1 69 

Val Gly Gin Tyr Val Ser Arg Ala Glu Glu Leu Lys Ala He Val Ser 
345 350 355 360 

TCT TCC AAT CAG GCC CTG CTG AGG CAG GGG ACC TCT GCC CGA GAC CTG 1217 
Ser Ser Asn Gin Ala Leu Leu Arg Gin Gly Thr Ser Ala Arg Asp Leu 
365 370 375 

CTC AGA GAG ATG GCC CGG GAC AAG CCA CGC CTC CTA GCT GCC CTG GAA 1265 
Leu Arg Glu Met Ala Arg Asp Lys Pro Arg Leu Leu Ala Ala Leu Glu 
380 385 390 

GTG GCT TCA GCT GCC ATG GCC AAG GAG GAG GCC GCC GGC GGG GAG CAG 1313 
Val Ala Ser Ala Ala Met Ala Lys Glu Glu Ala Ala Gly Gly Glu Gin 
395 400 405 

GAT GCC CTG GAC CTG TAC CAG CAC AGC CTG GGG GAG CTA CTG CTG TTG 1 361 

Asp Ala Leu Asp Leu Tyr Gin His Ser Leu Gly Glu Leu Leu Leu Leu 
410 " 415 420 

CTG GCA GCG GAG CCC CCG GGC CGG AGG CGG GAG CTG CTT CAC ACT GAG 1 409 

Leu Ala Ala Glu Pro Pro Gly Arg Arg Arg Glu Leu Leu His Thr Glu 
425 430 435 440 

GTT CAG AAC CTC ATG GCC CGA GCT GAA TAC TTG AAG GAG CAG ATG AGG 1457 
Val Gin Asn Leu Met Ala Arg Ala Glu Tyr Leu Lys Glu Gin Met Arg 
445 450 455 

GAA TCT CGC TGG GAA GCT GAC ACC CTG GAC AAA GAG GGA CTG TCG GAA 1 505 

Glu Ser Arg Trp Glu Ala Asp Thr Leu Asp Lys Glu Gly Leu Ser Gitt 
460 465 470 

TCT GTT CGT AGC TCT TGC ACC CTT CAG T GACCCTAGAA GAATGATTGG 1 553 

Ser Val Arg Ser Ser Cys Thr Leu Gin 
475 480 

ACAGATGTGA GCCATCTGGA GCAGAGGGGC ACTAACCCAG GCTGACGCCA AGAATGAAGT 1613 

GGCCCACTGC AGCCCTGGCG AGCAGGCTTC TTGGATGGAC AGTGCTGAGA CCCCCATATC 1 673 

CCAGAGTCCC CAGCCTCCCT CAGGTTACTC TGCACCCCAC AGATGGTTTG ATGGCTGTGC 1 733 
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TGTATACTGG AGGGGAGGGC AGGACTCTGG GAGAACAGCA CTTCTTTCAT GAGACCTTTG 1 793 
TTACTCGGTG GTTACTGGGT CCTGTGCCTG TCCGTTTTGG GGCATGCAGC CCTCTATCAT 1 853 
TTTTGGCTCC G AG AAG AGGG C AAGGGGCCC CCGCAGGGTA CTTCTGTGCT TGCCCTCGCC 1913 
CTGCCAGCAG GCAGCTGTGC CCCTGGCCTG CCCTTCCCGG GACCCCTTAT TCCAACTCAG 1 973 
CTCCTCTTTG CACTGGAATG GGGCACTCCA ACACCCCTCA GGGACCACCC TCCCCACAGT 2033 
ATGCACTCAG CCCCACAGAA CCCACCAGTC TTTCTGGGAA CTCACACCTG CCCGCCATCT 2093 
TGGTACTTTA GGTTAATCCC TCAAGCATGA AAGCTGGATC TTTTGGGGTT T AAGAAGCCC 2153 
AAGCCTTGTT CCTGCCCTGG CCTAGGGAGC ACTCAGGAGG GTTCCTTGGT CCTCATCTCT 22 1 3 
CCCACCTCCG TTCCCTCTGG GCCCCACACT AGCCACAGCG CGGGCCTTGT GCTGGAGTTT 2273 
GAGCCTGGGA CAGGGAGAGG GAGGCTTGGA GACAGTCTGA CCCAGTGCCC TCTAGGCCAC 2333 
CCACTTCTAG GCCTGCCCTG CCGCCGTGGA GCCCTGGGCA AGCTCTTTCC CCTTTCTGGG 2393 
CCTGGGTCTC CCCATCTCTT CAATGGGGCT GATACCTTCA CAGCCCACAG CATGGCACTT 2453 
ATGAGGACAA AGTGAATTTA ACCTGGAAAA GAATGTATTT GAGAGTTTCT TTTAAATAAT 25 1 3 
CAGCGGGTGT TGGTGATTTG TAGCCCTTCT GCCCTTAAAT GCTTCCTTGG GCAAGAGCTG 2573 
TCTGTCCTCC CTGCAGGAGG CTGAGTGTGA AGAGTATCAT TCATTGTTTC TTTATTAAAT 2633 
TATTTTCTCT A 2644 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gin Arg Asn Gly Ser Ala Ser Arg Gly Leu Glu Lys Thr Arg Leu 
1 5 10 15 

Arg Leu Cys Arg Glu Ala Arg He Pro Glu Ser Ala Phe Leu Thr Gly 
20 25 30 

Leu Thr Arg Glu Ser Trp Glu Ala Arg Cys Trp Cys Ala Lys Asp Thr 
35 40 45 

Arg Glu Val Val Ala lie Lys Cys Val Ala Lys Lys Ser Leu Asn Lys 
50 55 60 

Ala Ser Val Glu Asn Leu Leu Thr Glu He Glu He Leu Lys Gly He 
65 70 75 80 
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Arg His Pro His He Val Gin Leu Lys Asp Phe Gin Tip Asp Ser Asp 
85 90 95 

Asn lie Tyr Leu He Met Glu Phe Cys Ala Gly Gly Asp Leu Ser Arg 
100 105 110 

Phe He His Thr Arg Arg He Leu Pro Glu Lys Val Ala Arg Val Phe 
115 ~ " 120 125 

Met Gin Gin Leu Ala Ser Ala Leu Gin Phe Leu His Glu Arg Asn lie 
130 135 140 

Ser His Leu Asp Leu Lys Pro Gin Asn He Leu Leu Ser Ser Leu Glu 
145 * 150 155 160 

Lys Pro His Leu Lys Leu Ala Asp Phe Gly Phe Ala Gin His Met Ser 
165 170 175 

Pro Trp Asp Glu Lys His Val Leu Arg Gly Ser Pro Leu Tyr Met Ala 
180 185 190 

Pro Glu Met Val Cys Gin Arg Gin Tyr Asp Ala Arg Val Asp Leu Trp 
195 200 205 

Ser Met Gly Val He Leu Tyr Glu Ala Leu Phe Gly Gin Pro Pro Phe 
210 215 220 

Ala Ser Arg Ser Phe Ser Glu Leu Glu Glu Lys He Arg Ser Asn Arg 
225 230 235 240 

Val He Glu Leu Pro Leu Arg Pro Leu Leu Ser Arg Asp Cys Arg Asp 
245 250 255 

Leu Gin Gin Arg Leu Leu Glu Arg Asp Pro Ser Arg Arg lie Ser Phe 
260 265 270 

Gin Asp Phe Phe Ala His Pro Trp Val Asp Leu Glu His Met Pro Ser 
275 280 285 

Gly Glu Ser Leu Gly Arg Ala Thr Ala Leu Val Val Gin Ala Val Lys 
290 295 300 

Lys Asp Gin Glu Gly Asp Ser Ala Ala Ala Leu Ser Leu Tyr Cys Lys 
305 310 315 320 

Ala Leu Asp Phe Phe Val Pro Ala Leu His Tyr Glu Val Asp Ala Gin 

325 330 335 

Arg Lys Glu Ala He Lys Ala Lys Val Gly Gin Tyr Val Ser Arg Ala 
340 345 350 

Glu Glu Leu Lys Ala He Val Ser Ser Ser Asn Gin Ala Leu Leu Arg 
355 360 365 

Gin Gly Thr Ser Ala Arg Asp Leu Leu Arg Glu Met Ala Arg Asp Lys 
370 375 380 

Pro Arg Leu Leu Ala Ala Leu Glu Val Ala Ser Ala Ala Met Ala Lys 
385 390 395 400 
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Glu Glu Ala Ala Gly Gly Glu Gin Asp Ala Leu Asp Leu Tyr Gin His 
405 410 415 

Ser Leu Gly Glu Leu Leu Leu Leu Leu Ala Ala Glu Pro Pro Gly Arg 
420 425 430 

Arg Arg Glu Leu Leu His Thr Glu Val Gin Asn Leu Met Ala Arg Ala 
435 440 445 

Glu Tyr Leu Lys Glu Gin Met Arg Glu Ser Arg Trp Glu Ala Asp Thr 
450 455 460 

Leu Asp Lys Glu Gly Leu Ser Glu Ser Val Arg Ser Ser Cys Thr Leu 
465 470 475 4go 

Gin 
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